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ABSTRACT 

Understanding the mechanisms that lead dense environments to host galaxies with redder col- 
ors, more spheroidal morphologies, and lower star formation rates than field populations remains an 
important problem. As most candidate processes ultimately depend on host halo mass, accurate char- 
acterizations of the local environment, ideally tied to halo mass estimates and spanning a range in 
halo mass and redshift are needed. In this work, we present and test a rigorous, probabalistic method 
for assigning galaxies to groups based on precise photometric redshifts and X-ray selected groups 
drawn from the COSMOS field. The groups have masses in the range 10^'^ < M2ooc/Mq < 10^'^ 
and span redshifts < 2 < 1. We characterize our selection algorithm via tests on spectroscopic 
subsamples, including new data obtained at the VLT, and by applying our method to detailed mock 
catalogs. We find that our group member galaxy sample has a purity of 84% and completeness of 
92% within 0.5i?2ooc- We measure the impact of uncertainties in redshifts and group centering on the 
quality of the member selection with simulations based on current data as well as future imaging and 
spectroscopic surveys. As a first application of our new group member catalog which will be made 
publicly available, we show that member galaxies exhibit a higher quenched fraction compared to the 
field at fixed stellar mass out to z ^ 1, indicating a significant relationship between star formation 
and environment at group scales. We also address the suggestion that dusty star forming galaxies 
in such groups may impact the high-£ power spectrum of the cosmic microwave background and find 
that such a population cannot explain the low power seen in recent SZ measurements. 

Subject headings: catalogs ~ galaxies: groups: general - galaxies: star formation 



|mgeorge@astro.berkeley.edul 

^ Department of Astronomy, University of California, Berke- 
ley CA 94720, USA 

Lawrence Berkeley National Laboratory, 1 Cyclotron Road, 
Berkeley CA 94720, USA 

^ Berkeley Center for Cosmological Physics, University of Cal- 
ifornia, Berkeley CA 94720, USA 

* Max-Planck-Institut fiir Extraterrestrische Physik, Giessen- 
bachstrafie, 85748 Garching, Germany 

^ University of Maryland Baltimore County, 1000 Hilltop Cir- 
cle Baltimore, MD 21250, USA 

° Center for Cosmology and Particle Physics, Department of 
Physics, New York University, 4 Washington Place, New York, 
NY 10003, USA 

Institute for the Physics and Mathematics of the Universe, 
The University of Tokyo, Kashiwa, Chiba 277-8568, Japan 

® Institute of Astronomy & Astrophysics, Academia Sinica, 
Taipei, Taiwan 

^ University of Paris Denis Diderot, 75205 Paris CEDEX 13, 
France 

10 GEPI, Observatoire de Paris, Section de Meudon, 92195 
Meudon CEDEX, France 

11 Laboratoire d'Astrophysique de Marseille, CNRS Univer- 
site de Provence, 38 rue F. Joliot-Curie, 13388 Marseille Cedex 
13, France 

1^ Service d'Astrophysique, CEA-Saclay, Orme de Merisiers, 
Bat. 709, 91191 Gif-sur-Yvette, France 

Kavli Institute for Particle Astrophysics and Cosmology; 
Physics Department, Stanford University; and SLAG National 
Accelerator Laboratory, Stanford CA 94305, USA 

Institute for Theoretical Physics, University of Zurich, 8057 
Zurich, Switzerland 

1^ Spitzer Science Center, 314-6 Caltcch, 1201 East California 
Boulevard, Pasadena, CA, 91125, USA 

1^ European Southern Observatory, Karl-Schwarzchild-str. 2, 



85748 Garching, Germany 

^'^ Universita di Napolia Federico II, Dipartimento di Sciennze 
Fisiche and INAF Observatorio Astronomico di Capodimonte, 
V. Moiariello 16, 80131 Napoh, Italy 

1* Laboratoire d'Astrophysique, Ecole Polytechnique Federale 
de Lausanne (EPFL), Observatoire de Sauverny, 1290 Versoix, 
Switzerland 

1^ Astrophysical Observatory, City University of New York, 
College of Staten Island, 2800 Victory Blvd., Staten Island, NY 
10314, USA 

^0 Space Telescope Science Institute, 3700 San Martin Drive, 
Baltimore, MD 21218, USA 

21 Institut d'Astrophysique de Paris, UMR 7095, 98 bis Boule- 
vard Arago, 75014 Paris, France 



2 



George et al. 



1. INTRODUCTION 

Galaxies in dense cluster regions have long been known 
to have different characteristics than counterparts in 
the field, with redder colors, a greater tendency for 
spheroidal morphologies, and suppressed star formation 
rates. Dense clusters are also the sites of the most mas- 
sive and luminous galaxies. Much effort has been made to 
find the redshift, halo mass, and cluster-centric distance 
at which these distinctions between galaxy populations 
are imprinted an d the process by which these transforma- 
tions oc cur (e.g., I 0emler|1974" Dressler|1980 Butcher &J 
Oemler|[T9"8 4 Dress ier et ai. 11 19971 [Poggianti e t al. 1999; 
Lewis et al . 2002; Goto et al. 2003' Balogh et al. 2004; De 
Propris et al . 2004; Kauffmann ct al. 2004; Lin ct al. 2004. 
Bianton et~al.||2005) |Cucciati et al.||2 006 Cooper et at. 
20061 IWeinmann et al ||2006| [CapaF et al. 20 07a, |Gerke 



et ai. 



2009 



1 



20071 IBlanton fc Moustakas||2009; Hansen et al 



ei et al.||2009[ [Feruglio erar]|2010^ . WEUe" mas 



sive clusters present clear examples of galaxy transfor- 
mations due to gas stripping, merger ac tivity, and tidal 
disruption (e.g ., |Kenney et al.||1995; ^Gavazzi et al.||200T 
Cortese et al. 2007[ ), the exteiit to which these processes 
attect the majority of galaxies which live in less dense 
environments is uncertain. Extending cluster samples to 
groups with lower halo masses and higher redshifts is 
challenging because it requires significant observational 
expenditures and careful analysis to isolate such environ- 
ments from the field. 

Recent analyses at low redshift have confirmed the ex- 
istence of an environmental dependence of galactic struc- 
ture and colors across a range of environm e nts (e.g., 
Kauffmann et al.| [20041 [Baldry et a"L]|2006l |Bamford| 



et al. 2009p . The corresponding picture at z ~ 1 has 
been less clear. With pointed observations around high- 
redshift galaxy clusters, several studies have found sig- 
nificant trends in morphology, colo r, and star-forrnation 
rate with local galaxy density (e.j^ ., Postman et al.|2005 



Smith et all [20051 |Tanaka et ar|[2(T05 ; P oggianti et al. 



2008p . However, some find that the relations disappear 
m stellar mass-selected samples, arguing that environ- 
mental trends are due to differences in the stellar mass 
distribution between environments rath er than physical 



Poggianti et al. 



proce sses acting in dense regions (e.g 
2008| . 

In field surveys reaching z ~ 1, results fr om the 
VIRM OS-VLT Deep Survey (VVD S; Scodeggio et al. 



2009 



and zCOSMOS 



lovino et al. 201(FKovac et al 



Tasca et al.^'^OOg 



Cucciati et al 
20lOat 



show little 

or no environmental innuence on morphology and color 
especially at high stellar masses (log(M^/MQ) > 10.7), 



while results from DEEP2 dGoopcr ct al. 2010) and oth 



ers from zCOSMOS ( jPeng et al. |2010) show a clear rela- 
tionship between color and environment. These papers 
generally find weakening environmental trends with in- 
creasing redshift, but diffe r in the red shift at which the 
trends disappear. Cooper et al. |2007) and Cooper et al 



( [2010| ) discuss the discrepancies m environmental trends 
seen m high-redshift field surveys and suggest that the 
non-detection by some studies could be due to the use 
of less confident spectroscopic redshifts and lower sam- 
pling rates, as well as increased difficulty with determin- 
ing environmental de nsities using optica l spectroscopy 
at high redshift, while Peng et al. (2010j) attributes the 



differences to the definitions used to characterize envi- 
ronments. 

The aim of this work is to define a clean sample of 
galaxies in dense group environments out to redshift 
2; = 1 to address these issues. We study groups from 
the COSMOS survey that have been identified as sources 



of extended X-ray emission (Finoguenov et al. 2007 



and in prep.), which is a strong indication that they 
are virialized structures and not chance associations of 
galaxies. The groups have halo masses in the range 
10^^ < M2ooc/M c7) 5. 10^"^ as determined by weak lens- 
ing ( Leauthaud et aL||20iO| ). In a companion paper, we 



describe weak lensing tests to optimize the identification 
of halo centers (Paper II; George et al., in prep.). We 
select member galaxies based on photometric redshifts 
derived from extensive multi-wavelength imaging, which 
provides a much greater sampling density than existing 
spectroscopic surveys. Using a spectroscopic subsample 
and mock catalogs, we carefully evaluate our member 
selection for potential biases or contamination, and ac- 
count for photometric redshift uncertainties. This robust 
sample of group members can be used to address unset- 
tled questions about the link between galaxies and their 
environments. 

A key challenge is to disentangle the intrinsic and ex- 
trinsic factors that may play a role in shaping galaxy 
properties. For instance, galaxies in dense regions have a 
higher characte ristic stellar mass than in less dense envi- 
ronments (e.g., Baldry ct al. 2006), so a morphology-mass 
relation could be confiatcd with a morphology-density re- 
lation. Since stellar mass plays an important role in de- 
termining galaxy properties, and mass-to-light ratios are 
strongly affected by star formation activity, recent envi- 
ronmental studies have stressed the use of stellar mass- 
selected samples rather than luminosity-selected sam ples 
to make a fair comparison across environments (e.g.. 



van 



der Wei et al||2007l [Scodeggio et all[2009l [Cooper et al 



2010F 



In addition to controlling for intrinsic galaxy proper- 
ties in these studies, defining and measuring the "en- 
vironment" presents another problem. The distance to 
the iVth nearest neighbor or the mean density of galax- 
ies inside a fixed radiu s are cornmonl y used as environ- 
mental indicators (e.g., Dresslerjl980 1. Kauffmann et al. 
([2004 ) show that galaxy properties correlate most tightly 
with local density on scales below ~ 1 Mpc and are 
uncorrelated with the density on larger scales once the 
small-scale density is fixed. Several studies have shown 
evidence that galaxy properties correlate most tightly 
with density within their halo, and have emphasized that 
the aperture used for comparing equivalent regions must 
scale with halo mass to avoid confusion between local 



and global densities (Hansen ct al. 2005; Wcinmann et al 
[2006,, Bianton fc Bciiind^2007, Haas ct ai. 201l| . Insteac 
ot using the galaxy density field to define environment , 
one can define a catalog of galaxy groups and clusters 
and study their properties as a function of halo mass 
and group-centric distance. In this paper we use the 
term "group" to denote a set of galaxies with a common 
dark matter halo and to emphasize the low mass range 
studied, making no formal distinction between groups 
and clusters. 

Catalogs of galaxy groups have been constructed from 
both optical surveys identifying galaxy overdensities and 
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X-ray surveys detecting the hot gas trapped by deep 
gravitational potentia ls. Optical catalogs o ften employ 
matched filte rs (e.g., Postman et al. 1996[) and tesse- 
lations (e.g., |Marinoni et al.||2UU2| |Gerke et al.||2005 l 
to isolate groups from the background held, and red 
sequence methods have proven efficient at identifying 



groups over large v olumes (e.g., Gladders & Yee 2005_ 
Koester et al.||200"7 ). These catalogs typically assign the 
brightest mernber galaxy as the center of each group, and 
use the richness determined by the number of members 
as a proxy for the total mass. X-ray detections can re- 
duce the likelihood of projection effects, improve mass 
estimates, help with the determination of group centers, 
and shed light on the interpla y between the hot gas and 



stellar cont ent of groups (e.g ., 'Mulc haey et al.| 2003 
et al.||2004j[Finoguenov et al . 2007; Sun et al.|f2 nug| 



Lin 



(Jur understanding of group properties has benefited 
from small , well-s tudied samples at low redshift (e.g., 
Mulchaey fc Zablu doffll99"8ljZabludoff fc Mulchaey|1998l 
Tran et ar||2001j |Sun et al.|j2009[ ) along with larger sta- 
tistic al samples taken over wide areas or to high redshifts 



(e.g. |Eke et al.|2004 |Gerke et al.|20 05 Gladders fc Yee 
20^ |lViillgi^eial,[|j(]05l |Vang ei 3 .111 2005 ; Bcrhnd et al 
2006 ; Hansen et al. 2009]). These studies have established 
many similarities between groups and more massive clus- 
ters, including their extended dark matter halos and el- 
evated fractions of quenched early type galaxies relative 
to the field. Groups show some differences from clusters 
including gas mass fractions that are typically lower in 
less massive systems, and the differences between phys- 
ical processes acting on galaxies in groups and those in 
clusters are still being explored. Recent and ongoing sur- 
veys are pushing to greater sample sizes and higher red- 
shifts with multi-wavelength observations and s pectro- 
scopic campaigns (e. g., Osmo nd fc Ponman 2004;. Drive r 



et al.||2009( [Milkeraitis et ar||2010[ [Adami et al.[|ij010[ ) 
Several large imaging surveys m development plan tc 



Several large imaging surveys m development plan to 
study the growth of structure without significant spectro- 
scopic observations initially (Dark Energy SurvejFl Hy- 
per Suprime-Cairj^ Large Synoptic Survey Telescope^ 
EUCLIEFJ), so photometric redshifts will be important 
for identifying member galaxies using techniques such as 
those outlined in this paper. 

We study groups in the COSMOS field where a unique 
data set has been compiled for studying the interplay 
between galaxies, intragroup gas, and dark matter in 
galaxy groups out to z ~ 1. The COSMOS survey has 
obtained X-ray observations for group detections, deep 
imaging data spanning ultraviolet (UV), optical, and in- 
frared (IR) wavelengths for precise photometric redshifts 
and stellar masses, extensive spectroscopic coverage, and 
high resolution imaging from the Hubble Space Tele- 
scope (HST) for measuring morphologies and weak lens- 
ing. Group catal ogs have been construct ed in this field 
from X-ray data ( Finoguenov et al . || 2007 and in prep.), 
zCOSMOS spectroscopy fKnobel et al."2009'), photomet- 
ric redshifts ( Gillis & Hudson 2011), Cl^'H'i'LS-Deep pho- 
tometry ([Olsen et al. 2007; Grove et al. 2009), and with 



a combination of weak lensing and matched filters (Bel- 



^ 'http : / /www . darkenergvsurvev . org 




^http : //sumire . ipmu. ip/ en 




http : / /www . Isst . org 


— * 


http : //sci . esa. int/euclid 



lagamba et al.||2011[ ). Additionally, |Scoville et al.| ( |2007 1 
studied large-scale s tructures in t his field using photo 



metric redshifts, and Kovac et ah] ( 2010b[ ) measured the 
galaxy density field using zCOSJVioS redshifts to probe 
a large dynamic range of environments. Here we focus 
on the X-ray selected group catalog to ensure a pure 
sample of virialized structures whose masses have been 
characterized with weak lensing ( Leauthaud et al.|2010 ). 
Giodini et al. (2009) have studied the stellar mass con- 
tent of these X-ray groups; we expand upon their work 
with a thorough characterization of a new member selec- 
tion algorithm and develop a group member catalog for 
a variety of applications. 

The data used in making our group catalog are de- 
scribed in § [2j In § [3j we present the sample selection 
and sensitivity limits, along with tests of the quality of 
the photometric redshifts constructed from the imaging 
data. Our selection algorithm for the member catalog 
is described in § H] where we associate member galaxies 
with groups based on their proximity to the X-ray center 
and their photometric redshifts. In § [5| we characterize 
the reliability of our selection with mock catalogs from 
simulations and by comparing our photometric redshift 
selection to the subsample of sources with spectroscopic 
redshifts. We make the catalog of group membership as- 
signments and galaxy properties publicly available, de- 
scribing the format and release in § [6j We discuss in § [T] 
some of our initial findings from the catalog, including 
the infiuence of the group environment on galaxy col- 
ors out to z ~ 1. We find evidence of suppressed star 
formation in galaxies in group environments over the en- 
tire redshift range studied, and briefly discuss how the 
low incidence of star forming galaxies in groups cannot 
play a significant role in explaining recent observations 
of a deficit of power from the S unyaev-Zel'dovich effect 
(SZ; Sunyaev fc Zeldovich|1972 ) in the angular spectrum 



of the cosmic microwave backg round (CMB; e.g., Lueker 
et al.||2010| |Fowler et all2010[ ) 



We adopt a WMAP5 ACDM cosmology to determine 
distances and halo masse s with = 0.258, = 0.742, 
Ho = 72 km s~ ] Mpc~^ (Dunkley et al. 2009), the same 



values used by Leauthaud et al. ( ^2010 ) to calibrate the 
masses of this group sample. f)istances are expressed 
in physical units of Mpc, magnitudes are given on the 
AB system. X-ray luminosities are expressed in the rest- 
frame 0.1-2.4 keV band, and logarithmic quantities use 
base 10. Group masses are estimated from th eir X-ray lu- 
minos i ty usin g the Lx — M relation derived in |Leautliaud| 
et al. (2010) and concentrations ar e then derived from 
the rnass-concentration relation of iZhao et al. (|2009[) 



assumi ng an NFW density profile ( [JNavarro, I''renk, fc 
White] fl996|. We estimate the virial radius of groups 



as K2mci the radius within which the mean density is 
200 times the critical density of the Universe at the red- 
shift of the group, Pc(2g), and use the corresponding halo 

masses defined as Af200c = (200/9c(2G))(47r/3)i?2ooc^- 
We also make use of the NFW scale radius, defined as 
Rs = ^200c/c200c, where C2ooc is the concentration pa- 
rameter. 

2. COSMOS DATA 

The COSMOS field has been observed in a broad range 
of wavelengths, with imaging data from X-ray to ra- 
dio and a large spectroscopic follow-up program (zCOS- 
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MOS] 


with the Very Large Telescope (VLT) (Scoville 


et al. 


2007 Koekemoer et al. 


2007 Lilly et alT 20071. 



We have added to the spectroscopic sample in groups 
with a recent campaign using the Focal Reducer and low 
dispersion Spectrograph 2 (F0RS2) at the VLT (Pro- 
gram ID 084.B-0523; PL Mei). X-ray imaging has been 
taken with the XMM-Newton (1.5 Ms coverin g 2.13 deg^; 



Hasinger et al. 2007 Cappelluti et al. 20091 and Chan- 
dra o bservatories (1.8 Ms covering 0.9 deg^; Elvis et al. 



2009| . Imaging obtained through the F814W hlter oi the 
Wide Field Channel (WFC) of the Advanced Camera for 
Surveys (ACS) on HST adds accurate s hape measure- 
ments for morphologies and weak lensing ( Scarlata et al.^ 
2007 Leauthaud et al.|2007 ). Observations of over thirty 



photometric bands covering the ultraviolet, optical, and 



(]Capak et al. 


2007b 


Ilbert 


Ishitt uncertamty a-p ^ 0.01 



for galaxies with F814W < 22.5 , and ay 
F814W=24, at z < 1.2 (see 



The difference in these two flags reflects the uncertainty 
in the X-ray position; for xflag = 2 the uncertainty in 
each coordinate is assumed equal to the wavelet scale of 
32", and for xflag = 1 sources, which have more certain 
centers, the uncertainty is 32" divided by the significance 
of the flux measurement. The mean uncertainty in right 
ascension and declination for sources with flags 1 and 2 
is 23" (120 kpc at z = 0.4 and 170 kpc at z = 0.8 for our 
adopted cosmology). 

The main chan g es fro m the catalog described in 
Finoguenov et al. (20071 are the detection of fainter 
groups thanks to deeper XMM coverage, a more con- 
servative point-source removal procedure, increased red- 
shift accuracy due to the availability of more spectro- 
scopic data and improved photometric redshifts, and 
some changes in quality flags after visual inspection of 
optical counterparts. In total, the catalog used in this 
paper contains 211 extended X-ray sources over 1.64 
0.03 for deg^, spanning the redshift range < z < 1 and with 



2.3 and 



3.2 for details and 
m this paper). 



tests of the photometric redshifts usee 

2.1. X-ray Catalog 

The entire COSMOS region has been mapped through 
54 overlapping XMM-Newton pointings and additional 
Chandra observations covering the central region (0.9 
deg^) with higher spatial resolution. A mosaic combining 
these two data sets has been used to flnd and measure 
the fluxes of groups using a w avelet transform method 
described in Vikhlinin et al.| (|l998). The data reduc- 
tion process including the combmation of X-ray data sets 
and identiflcation of opt ic al counterparts fol l ows th at of 



iFinogucnov et all (|2009|); iFinoguenov et al.l (|2010|). An 

^ — ■ . ■ I -i — r ' — r ' i't— T-TT-TTm-m-r-T'— i—r • ' ' i 



mitial gr oup catmog f roin th e COSMOS field is presented 



Finoguenov et al.| ( |2007[ ). 
:5riefly, extended objects are detected in the mosaic 
when the sum of the flux on scales of 32" and 64" is 
greater than the flux on small scales by a given threshold. 
Detections on smaller scales tend to be contaminated by 
point sources, which are cleaned from XMM and Chan- 
dra data separately to allow for variability. The flux is 
calculated using a scaling relation that incorporates a /?- 
model fit to the surface brightness within 32", resulting 
in a 4ct detection limit of the group sample of 1.0 x 10~^^ 
erg cm~^ over 96% of the ACS field. Once extended 
X-ray sources are detected, a red sequence finder is em- 
ployed on galaxies with a projected distance less than 
0.5 Mpc from the centers to identify an optical counter- 
part and determine the redshift of the group, which is 
then refined with spectroscopic redshifts when available. 
The red sequence finder only requires an overdensity of 
red galaxies and not a deficiency of blue galaxies, mean- 
ing that it does not specifically require an enhanced red 
fraction to identify groups. 

A quality flag (hereafter xflag) is assigned to the re- 
liability of the optical counterpart, with flags 1 and 2 in- 
dicating a secure association, and higher flags indicating 
potential problems due to projections with other sources 
or bad photometry due to bright stars in the foreground. 
We run our membership algorithm on all detections in 
the catalog with zq < 1 but in later analyses limit the 
sample to groups with xflag = 1 and 2, which have re- 
liable spectroscopically conflrmed optical counterparts. 



a rest-frame 0.1-2.4 keV luminosity range of 41.3 < 
log(Lx/erg s^^) < 44.1, and 165 of these groups and 
clusters have secure optical counterparts with xflag — 1 
or 2. X-ray detections without clear optical counter- 
parts are likely a mix of unresolved active galactic nuclei 
(AGN), projections of multiple systems, and background 
fluctuations; tests of the identification method using a 
larger spectroscopic sample will be presented with the 
updated X-ray catalog in a separate paper (Finoguenov 
et al., in prep.). 

2.2. Spectroscopic Data 

The COSMOS field has been targeted by a number 
of spectroscopic campaigns. We use spectra from the 
zCOSMOS "20K sample" (Lilly et al., in prep.) which 
targeted galaxies in the ACS area to a magnitude limit of 
i+ = 22.5, along with other spe ctroscopic data sets from 
Keck, MMT, SD SS, and VLT ( |Prescott et al][2006l [Ca^ 
pak et alT]|2010 |. We include in this paper a new sarnpl e 



from our recent program with F0RS2 /VLT (see § |2.2.1[ ) 
Each redshift has an associated confidence flag; we use 
only those of class 3 or 4 meaning that the redshift is se- 
cure or very secure. In repeat observations, zCOSMOS 
targets with the se confidence clas ses have a verification 
rate of > 99% ( [Lilly et aLl|2007[ ). For galaxies in the 
ACS field with F814W < 24.2 and any redshift passing 
this quality cut, the spectroscopic sample has 529 galax- 
ies from F0RS2, 11619 from zCOSMOS, and 1527 from 
other sources. These spectra are distributed through- 
out the redshift range used in this paper, with 1931 at 
0.05 < z < 0.25, 4257 at 0.25 < z < 0.50, 4184 at 
0.50 < z < 0.75, and 1980 at 0.75 < z < 1.00. 
A spectroscop ically-selected g r oup c atalog has been 



constructed by Knobel et al. (20091. We defer a 



detailed comparison between the galaxy content of 
spectroscopically-selected groups and X-ray selected 
groups to fu t ure wo rk (Finoguenov et al., in prep.). 
Kovac et al. (2010b) showed that there is good gen- 
eral correspondence between the overdense regions in 
the galaxy density field, the spectroscopically-selected 
groups with N^cm ^ 4, and the X-ray detected groups. 

Our primary use of the spectra is to obtain precise 
group redshifts and to verify the accuracy of photomet- 
ric redshifts, which are critical for both the membership 
selection and the weak lensing analysis. Roughly 20% of 
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group members have spectroscopic redshifts in addition 
to the photometric redshifts used for member selection. 

2.2.1. F0RS2 Spectra 

We have recently obtained additional spectra of galax- 
ies in the COSMOS field with the F0RS2 spectrograph 
at the VLT. Targets were selected for a number of scien- 
tific goals including velocity dispersion measurements of 
massive central galaxies and a comparison sample of field 
ellipticals, a study of merger rates within groups based on 
the abundance of close pairs, and refined redshift deter- 
minations for group members. Data were taken on four 
clear nights with excellent conditions and 0.8" typical 
seeing from 2010 February 14-18. The F0RS2 instru- 
ment was used in MXU mode with the 600RI grism and 
GG435 order separation filter, providing a wavelength 
range of roug hly 4500 - 9000A. There were 27 masks 
each observed in 4 exposures of 650 seconds. Each mask 
had roughly 50 slits of width 0.6" for bright targets and 
1" for fainter ones, and a typical slit length of 8". 

These data have been reduced using the standard ES- 
OREX reduction pipelin^ In short, for each mask and 
detector we performed bias subtraction and overscan re- 
moval, determined a wavelength solution from He, HgCd, 
Ar, and Ne arc lamps, and found slit extraction regions 
using a pattern recognition algorithm on the arc and flat 
lamp exposures. Science exposures were bias subtracted 
and flat fielded before a median combination. The flat 
fields were first normalized by dividing out a smooth 
component calculated using a 10 x 10 pixel median filter 
to account for the intrinsic shape of the fiat lamp spec- 
trum. A local sky subtraction was performed on each 
CCD column prior to rectification, then cosmic rays were 
removed and object spectra were optimally extracted. 

For the extracted objects, we measured redshifts with 
a modified version of the zspec software used for the 
DEEP2 survey (Cooper et al., in prep.). Each one- 
dimensional spectrum was fit by a linear combination 
of galaxy eigenspectra and also compared with stellar 
and quasar templates over a range of redshifts to find 
possible redshift values. Spectral features important for 
fitting in this range of wavelengths and redshifts include 
[O II], CaK, CaH, G-Band, H^, [O III], Mgb, and NaD. 
Each spectrum was visually inspected by at least two 
co-authors alongside the two-dimensional spectral image 
to choose the best redshift and as sign a quality fiag ac- 
cording to the zCOSMOS system ( [Lilly et al.||2007[ ). In 
cases where the first two inspectors disagreed on the red- 
shift or quality fiag, a third person viewed the spectrum 
independently to reconcile differences. We include only 
those objects with a secure redshift (quality flag = 3,4) in 
our sample, which amounts to 529 galaxies. Our redshift 
success rate will improve with continuing reduction ef- 
forts to handle cases where slits were tilted to cover close 
pairs or to measure velocity dispersions along the major 
axis of a galaxy. There are 8 objects in this sample that 
have been observed by SDSS, with a median and scatter 
between redshift measurements of 16 and 43km s~^, re- 
spectively. For 126 objects observed by both F0RS2 and 
zCOSMOS, the scatter in redshifts is 160 km s^^ after 
removing 2 outliers with \Az\ > 0.002. We have detected 



a median offset of ~ 100 km s~^ between zCOSMOS red- 
shifts and those measured by F0RS2 and SDSS which is 
still under investigation, but since the magnitude of this 
offset is a factor of 3 smaller than the typical group veloc- 
ity dispersion and several times smaller than photometric 
redshift errors it should not impact our results. 

2.3. Photometric Redshifts 

Despite the extensive spectroscopic data available, cov- 
erage of group members is incomplete. We instead use 
photometric redshifts ( hereafter pho t o-zs) to determine 



distances to galaxies. Ilbert et al. (2009) constructed 



spectral energy distributions (SEDs) from over 30 bands 



of UV, optical, and IR data described in [Capak et al 
(2007b), and compared these SEDs with templates from 
galaxies at known redshifts supplemented with stellar 
population synthesis models. They computed photo-zs 
from SEDs using a template fitting method which 
included a treatment of emission lines. The derived 
X^(z) function was used to compute a probability den- 
sity function (PDF), 7^(2:), which is the likelihood that 
a galaxy lives at a redshift z given the photometric data 
and the spectral templates used. Rather than collaps- 
ing this function to a single value at the mean, median, 
or peak and assuming Gaussian uncertainty as is often 
done, we make use of the full PDF to determine group 
membership, described in §|4] In this paper we use an up- 
dated version (p dzBay-vl. 7. 010809) of the photo-z cata- 
log presented in Ilbert et al. ([2009| with additional deep 
H band data and small improvements in the template 
fi tting techniqu es 



[Ilbert et al. (2009) demonstrated that these photo- 
zs are precise and accurate thanks to the broad wave- 
length range covered by the photometric data and the 
many bands into which it is divided. Those authors dis- 
cussed the quality of the photo-zs in comparison with a 
number of samples of spectroscopic redshifts using the 
normalized median absolute deviation (NMAD = 1.48 x 
median(lzs - Zpl/(1 -I- Zs)); ^ 
is an estimator for cr^z/(i+^, 
liers 



Ilbert et al. 
offsets between t 



^ http : //www . eso . org/cpl/esorex . html 



Hoaglin et al. 19831, which 
that IS robust to out- 
(|2009) showed that the distribution of 
re photometric and spectroscopic red- 
shifts is well-fit by a Gaussian with a standard devi- 
ation equal to the NMAD. Applying this estimator to 
galaxies considered for group membership, i.e., those 
with F814W < 24.2, Zo< 1.2, and an avaflable steflar 
mass estimate (see § 2.4), there are over 12000 spectro- 
scopic redshifts and the overall agreement with photo-zs 
is U/^z/ii+Zs) = 0.008. However, the spectroscopic sam- 
ple is dominated by the zCOSMOS survey, which has a 
magnitude limit of z+ = 22.5. The other spectroscopic 
samples have a variety of selection functions, so we can- 
not assume that the sample is representative of the full 
photometric sample of galaxies. 

A second measure of uncertainty of a photometr ic red- 
shift comes directly from the width of the PDF. [Ilbert] 
et al. (2009 ) have shown that the shape of V{z) is broadly 
consistent with the distribution of offsets between photo- 
metric and spectroscopic redshifts. For example, 65% of 
objects have a redshif t off set within the 68% uncertainty 
on the PDF, a-p. In § [3.2[ we discuss further tests on the 
agreement between these two estimates of redshift uncer- 
tainty, a-p and ctaz (we henceforth drop the conventional 
factor of 1 -|- Zs to make direct comparisons between the 



6 



George et al. 



two quantities) , and we study variations in photo- 2; qual- 
ity that could bias our selection against different galaxy 
populations. 

2.4. Stellar Mass Estimates 

Stellar mas ses a re used in the identification of group 
centers (see § 4.3) and are estimated using the Bayesian 
code described m Bundy et al. ( 2006|), with good agree- 
ment to the masses determmed by Drory et al. (2009). 
For each galaxy, the SED and photo-z described above 
are referenced to a grid of stellar population synthe- 
sis mo dels constructed using the Bruzual fc Chariot 
(|2003|) code and assuming an initial mass function from 
Chabrier (2003). The grid includes models that vary in 



age, star formation history, dust content, and metallic- 
ity. At each grid point, the probability that the observed 
SED fits the model is calculated, and the corresponding 
stellar mass is stored. By marginalizing over all parame- 
ters in the grid, the stellar mass probability distribution 
is obtained. The median and width of this distribution 
are taken respectively as the stellar mass estimate and 
the uncertainty due to degeneracies and the model pa- 
rameter space. The final stellar mass error estimate also 
includes uncertainties from the K-band photometry and 
the expected error on the luminosity distance that re- 
sults from the photo-z uncertainty, producing a typical 
final uncertainty of 0.2-0.3 dex. Stellar mass estimates 
in this paper require a 3a detection in j^g- band, which 
is complete to a typical depth of Kg = 24 (McCracken 
eFaL][2010l ). 



3. SAMPLE LIMITS AND QUALITY OF PHOTOMETRIC 
REDSHIFTS 

3.1. Mass Limits and Quality Flags 

In this section we present the sample selection and sen- 
sitivity limits for galaxies and groups used in our anal- 
ysis. Since one of our limiting factors is the decline in 
photo-z precision at faint magnitudes, we also discuss 
tests of the accuracy and precision of photo-zs for differ- 
ent galaxy populations to show that our group member 
sample is not biased by variations in the quality of photo- 
zs. 

As mentioned in § |2.1[ we consider X-ray detected 
groups at redshifts < zq < 1- Identifying optical as- 
sociations with X-ray groups becomes more challenging 
at ZQ > 1 and typically requires dedicated spectroscopic 
followup, so we omit high redshift candidates from this 
work. In addition to the flags from the X-ray catalog 
describing the quality of the optical identification and 
centroid uncertainty, we record three additional fiags for 
each group: 

• MASK: more than 10% of the area within i?2ooc or 
within Rs of the X-ray center is masked in optical 
images or falls outside the edges of the ACS field 

• POOR: 3 or fewer member galaxies are associated 
with the group (using Pmcm > 0.5, see § |4]) 

• MERGER: the projected radius (i?200c) drawn from 
the X-ray center of one group overlaps with that of 
another by more than 25% and the group redshifts 
are consistent (|Az| < 0.01). 



We fiag groups in masked regions because their mem- 
bership may not be adequately represented and central 
galaxies may not be properly identified. Poor groups are 
flagged as possibly questionable optical associations or 
redshift determinations, and merging groups are flagged 
because the algorithm may confuse membership assign- 
ments. Of the 165 X-ray groups with a clear optical 
counterpart (xFLAG = 1 or 2), 10, 12, and 15 groups are 
assigned the MASK, POOR, and MERGER flags, respec- 
tively. Our rationale for assigning these flags is to attain 
a group catalog that is as pure as possible, though not 
necessarily complete. 

The left panel of Figure [T] shows the halo masses and 
redshifts for the group sample. Green squares represent 
the cleanest sample of 129 groups with XFLAG = 1 or 2, 
and none of the other flags set, black points relax the 
restrictions on the MASK, POOR, and merger flags, and 
gray dots represent the remaining sources in the catalog 
with higher values of xflag. The red curve shows the 4(T 
X-ray flux hmit reached in 96% of the field of 1.0 x lO'^^ 
erg cm^^ s^^ converted to a limiting group mass. Cov- 
erage is non-uniform, so some groups are detected below 
this threshold in areas with deeper coverage. Blue lines 
show the mass and redshift bins used for later analysis. 

To be considered for group membership and to de- 
rive stellar mass estimates, galaxies must be brighter 
than F814W < 24.2 and have a photo-z in the range 
< Zp < 1.2. Galaxies must also have a 3a Kg- 
band detection, for which the typical limiting depth is 
Ks = 24. Though the photometry in COSMOS is com- 
plete to z"*" = 26.2 and has sim ilar depths in other opti- 
cal filters (Capak et al. 2007b), the ii^s-band detection 
requirement causes detections in the ACS imaging to 
become incomplete near F814W=24.2, which is also in 
the magnitude ran ge where photo-z quality deteriorates 
rapidly (see § |3.2[ ). The F814W filter magnitude cor- 
relates more strongly with photo-z precision than longer 
wavelength filters (the 4000A break enters the filter range 
at z ^ 0.75 and remains in that range beyond our red- 
shift limit), so we use it to apply the formal magnitude 
cut at F814W=24.2. Taking this as our primary mag- 
nitude cut, we find that only 5% of the sample with 
F814W < 24.2 is excluded due to a nondetection in Kg 
or a failure to find an acceptable stellar mass fit, with 2% 
of bright objects (F814W < 22.5) and 8% of faint obects 
(23.5 < F814W < 24.2) being cut. Because of photo-z 
uncertainties, we allow galaxies to have a higher redshift 
limit than the groups in which they reside, giving the 
Zp < 1.2 cut. 

The right panel of Figure [T] shows the stellar masses 
and photometric redshifts for galaxies meeting the se- 
lection criteria. We plot only every third galaxy for 
clarity. The red curve shows the 85% stellar mass com- 
pleteness limit calculated for the oldest allowable stellar 
template at each redshift for the combined requirements 
of Ks < 24 and F814W < 24.2. This passive hmit is 
conservative, as younger stellar populations have lower 
mass-to-light ratios. At z = 1, our stellar mass limit is 
rough ly log(M^/M0) = 10.3, or 0.25M* ( [Drory et al' 
2009| found log(M*/M0) « 10.9 for the massive end of a 



double-Schechter function fit to the stellar mass function 
at z ^ 1, with little redshift evolution). Solid blue lines in 
the figure show the mass and redshift bins for later analy- 
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ses, and dashed lines are drawn for stellar mass bins that 
extend significantly below our completeness limits. Our 
sample has fewer galaxies and groups at z < 0.2 than at 
higher redshifts because of the smaller volume probed, 
but the increasing volume at higher redshifts allows for 
good statistical samples. 

3.2. Tests of Photometric Redshifts 

We have described how the increase in photo-z errors 
at faint magnitudes partially motivates our selection cut 
on galaxies brighter than F814W — 24.2, with the im- 
plicit concern that poorer photo-zs degrade our ability 
to assign galaxies to groups. Here we test how photo- 
z quality varies with other galaxy properties to ensure 
that our selection is not biased by systematic errors for 
certain galaxy populations. 

In principle, photo-z quality can depend on any prop- 
erty of an SED or the templates used in the fitting pro- 
cess. Redshifts of red galaxies with strong 4000A breaks 
have traditionally been easier to constrain than their 
bluer counterparts. Fainter galaxies have larger photo- 
metric uncertainties which propagate into their photo- 
zs. Galaxy mass and environment may play a role if, for 
example, the photo-z templates are not representative 
of evolutionary histories unique to dense group regions. 
Morphology can also have a subtle effect since the in- 
clinat ion of disks alte rs the extinction along the line of 
sight ( |Yip et al.|[20ll| ). 

Motivated by these possible sources of variation in 
photo-z quality, we divide our sample into different pop- 
ulations and quantify the precision and accuracy of their 
redshift estimates. We also compare two estimates of 
photo-z uncertainty, the 68% width of the PDF (a-p), and 
the deviation between photometric and spectroscopic 
redshifts (crAz/(i+Zs))- 

In order to test the reliability of the redshift un- 
certainty for different galaxy populations, we slice the 
galaxy sample into bins based on their brightness, red- 
shift, color, morphology, stellar mass, and environment. 
Here we use unextincted rest frame colors derived from 
the best-fitting templates using the difference between 
absolute magnitudes in near-ultraviolet (NUV) and R 
bands (C = M{NUV) - M{R)) as described by [Ilbert 
et al. (2010). In that paper, spectral classes were identi- 



lier fraction is typically no larger than a few percent 
In particular, the photometric depth in many ban ds and 
the tr e atmen t of emission lines in fitting SEDs by |Ilbert 



et al. 



tied with the following cuts on C from blue to red: 

C < 1.2 "high activity" 
1.2 < C < 3.5 "intermediate activity" 
C > 3.5 "quiescent." 

These classes were found to correlate with visually clas- 
sified morphologies as expected. For these tests, we use 
morphologies determined using the Zurich Estim ator of 
Structural Types (ZEST; |Scarlata et al ||2007[ ) on the 
ACS images. The results are compiled m I'able IT] in 
which we present the size and average magnitude ofeach 
population, along with the two measures of photo-z un- 
certainty and the fraction of sources for which the photo- 
z deviates significantly from the spectroscopic redshift. 

The two independent measures of photo-z uncertainty 
are in good agreement, suggesting that we can safely use 
PDF widths to quantify the precision of a given photo- 
z. Furthermore, we do not see strong trends in photo-z 
quality with galaxy type or environment, and the out- 



(20091 appears to balance the weakening 4000A 
break for bluer galaxies, so that photo-z quality does not 
significantly depend on color. The lack of strong vari- 
ations in photo-z uncertainties, and the agreement be- 
tween the two measures of photo-z uncertainties across 
galaxy types and environments demonstrates the robust- 
ness of these redshifts for differe nt populations . We h ave 
not included the photo-zs from Salvato et al. (2009) for 
AGN due to th eir rarity arid the r easonable accuracy of 
the photo-zs of Ilbert et al. (2009) for these sources, but 
future work focusing on AGJN may benefit from the im- 
proved redshift accuracy. 

While the photo-z accuracy is good across the sam- 
ple, the quality does decrease at fainter magnitudes. We 
account for this effect when selecting member galaxies 
by allowing larger tolerances in redshift space for fainter 
sources. There is also some degradation at higher red- 
shift, but since our sample is not as heavily weighted 
toward high redshifts as it is toward faint magnitudes, 
we do not currently account for the redshift dependence 
of photo-z accuracy when selecting group members. We 
note that Table [l] shows mean magnitudes and photo-z 
errors for objects that also have spectroscopic redshifts; 
these errors are representative of the PDF uncertainties 
for the full galaxy sample in the bright bins, but in the 
faint bins the spectroscopic sample is brighter than the 
full population and photo-z errors are smaller than aver- 
age. 

Figure [2] illustrates how the photo-z PDF uncertainty 
varies with magnitude, redshift, and color for the full 
galaxy sample. The parameter space is divided into bins 
of Azp = 0.01, AF814W = 0.1, and AC = 0.1. For 
bins containing at least 10 galaxies, the half-width of the 
median 68% uncertainty on 'P{z) is computed and plot- 
ted according to the color scale shown. Additionally, we 
plot the redshift, magnitude, and color distributions of 
galaxies to characterize the catalog. Clearly the strongest 
trend in photo-z precision is the decrease in quality at 
faint magnitudes, and there is only a weak dependence 
on redshift and galaxy color. Where the group member 
selection algorithm requires an estimate of redshift un- 
certainty, we consider only the magnitude dependence of 
the photo-z uncertainties, ignoring the smaller variations 
due to color and redshift. 

4. GROUP MEMBERSHIP SELECTION 

4.1. Overview 

This is not a paper about finding galaxy groups; in- 
stead our aim is to associate galaxies with groups that 
have already been identified as extended X-ray sources. 
Our basic strategy is to take th e loc ati ons of groups from 
the X- ray catalog described in § |2.1] and |Finoguenov et al.| 
(2007 and in prep.) and assign galaxies to groups based 
on their positions and redshifts. Previous work on find- 
ing group and cluster members has often included as- 
sumptions about properties such as their red sequence 
content, luminosity function, and radial distribution. 
Because galaxy group populations have not been well- 
characterized in the mass and redshift range probed by 
this data set, we do not apply such filters to select mem- 
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Fig. 1. — Mass limits for groups (left) and galaxies (right) over the redshift range < z < 1. Symbols for groups denote their quality 
flags; green squares have XFLAG=1,2 and mask=poor=merger=0, black filled circles have XFLAG=1,2, and gray open circles are the rest. 
Red curves denote the mass sensitivity corresponding with the X-ray flux limit (left) and the stellar mass limits for a passive galaxy 
(right). Blue boxes show the mass and redshift bins used for analysis in §[?] with dashed boxes denoting stellar mass bins with significant 
incompleteness. Only every third galaxy is plotted for visual clarity. 




0.30 



0.10 



0.03 



0.01 



12 3 4 
C = M(NUV)-M(R) 

Fig. 2. — Photo-2 uncertainties (cr-p) as a function of magnitude and redshift (left), and color (right). Main panels show photo-2 
uncertainty in bins colored according to the scale at right. Margin plots on the left side, bottom left, and bottom right show the magnitude, 
redshift, and color distributions, respectively. Curves are separated by color classification showing all galaxies considered (solid black), 
"high activity" galaxies (blue dot-dashed), "intermediate activity" galaxies (green dashed), and "quiescent" galaxies (red dotted). Dashed 
black lines show the galaxy magnitude and group redshift cuts at F814W=24.2 and zq = 1 for the sample, as well as the divisions between 
color types. Ordinate axes on margin plots should be multiplied by 10'' (left side) and 10^ (bottom) for normalization. We do not see 
strong variations in photo- 2: precision with redshift or color, but there is a significant magnitude-dependence. 



bers, with the hope that we can then measure these prop- 
erties in an unbiased manner. 

Effectively, we are selecting galaxies in a cylinder ori- 
ented along the line of sight around the X-ray position 
and redshift for each group. The radius chosen for this 
cylinder is the estimated i?200c of each group based on 
the total mass derived from the X-ray luminosity versus 
-^200c relatio n for the group sample as determined by 
weak lensing ( Leauthaud et al!||2010 ). The depth of the 
cylinder in redshift space is allowed to vary for each can- 
didate member galaxy according to the typical photo-z 
uncertainty for its apparent magnitude (see Fig. 2]). 

Photometric redshift uncertainties are larger than the 
typical intrinsic span of a galaxy group in redshift 
space. A typical photo-z error of a-p = 0.01 in redshift- 



space corresponds to an uncertainty of roughly 40 Mpc 
in distance along the line of sight, while a halo with 
log(Af2ooc/Af(7) ) = 13.5 has a velocity dispersion of 
AzG w 0.001 ( |Evrard et aT]|2008[ ) or a line of sight dis- 
tance uncertainty of roughly 4 Mpc at z = 0. As a result, 
we must account for contamination of the member sam- 
ple by galaxies at a similar redshift and position that do 
not belong to the group. One option is to subtract a 
mean background density from the number of galaxies 
found near the group. This statistical background sub- 
traction can be extended to other quantities of interest, 
such as the total stellar mass in a group, by measur- 
ing those quantities averaged over regions away from the 
group and subtracting them from the values measured 
at the position of the group. One is left with the mea- 
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TABLE 1 

Photo-2 quality for galaxies with spectroscopic redshifts 



Sample 






(F814W> 


med(2s - 2p) 








A 11 
All 


12370 


0.52 


21.3 


U.OUo 


0.012 


0.013 


1.1 


Bright; low z 


5768 


0.31 


20.7 


0.001 


0.009 


0.011 


1.0 


Bright; high z 


ODiO 


n 71 

U. 1 1 


Z±.U 


U.UUD 


n ni 


n 01 ^1 


1 

±.u 


Faint; low z 


280 


0.27 


23.0 


-0.002 


0.014 


0.016 


3.9 


Faint; high z 




u.ou 




n 007 
u.uu / 


n 097 


n 09A 


9 7 


Bright; blue 


6840 


0.52 


21.3 


0.002 


0.011 


0.013 


1.1 


Bright; green 




44 


21 


U.UUIJ 


n 01 A 


n 01 


1 2 


Bright; red 


^ nriQ 

lyyo 


U.OO 


zU.u 


n nriQ 


U.UlU 


U.Ui i 


U.4 




731 


0.70 


23.1 


0.000 


0.021 


0.021 


3.8 


Faint ; green 


zoy 


U.D'4i 


9*^ 9 


U.Ui 1 


n 09S 


n 097 

u.uz / 


9 7 


Faint; red 


67 


0.92 


23.1 


0.013 


0.031 


0.020 


7.5 


Bright; early type 


2450 


0.53 


20.4 


0.004 


0.011 


0.011 


0.8 


Bright; late type 


7248 


0.49 


21.3 


0.003 


0.012 


0.013 


0.7 


Bright; irregular 


1444 


0.59 


21.3 


0.002 


0.012 


0.012 


2.1 


High stellar mass 


4216 


0.62 


20.8 


0.005 


0.013 


0.012 


1.1 


Low stellar mass 


8154 


0.47 


21.5 


0.002 


0.012 


0.013 


1.2 


Near groups 


961 


0.45 


20.6 


0.002 


0.011 


0.012 


0.9 


Outside groups 


9691 


0.54 


21.4 


0.003 


0.012 


0.013 


1.2 


Clean regions 


10987 


0.53 


21.3 


0.003 


0.012 


0.013 


1.2 


Masked regions 


1439 


0.51 


21.2 


0.003 


0.013 


0.012 


2.5 


MMGCeale 


126 


0.49 


19.3 


0.000 


0.009 


0.010 


0.0 


AGN 


229 


0.56 


20.3 


0.003 


0.015 


0.011 


1.3 



Note. — Brightness bins are divided at F814W=22.5 which is the limiting magnitude 
for zCOSMOS; redshift bins are split at ^ = 0.5; color bins are M{NUV) - M(R) < 1.2 
(blue), 1.2 < M{NUV) - M{R) < 3.5 (green), and M{NUV) - M(R) > 3.5 (red); 
morphologies are categorized by ZEST; stellar masses are separated at log(M*/MQ) = 
10.5; group environments are classified as "near" within iJ200c of an X-ray group center 
and where \zs — zq\/{1 + zq) < 0.005, and "outside" beyond 3-R200c and where |2s — 
2gI/(1 + -^g) > 0.01. Masked regions are areas in the optical images with bright 
foreground stars, satellite trails, or image defects. MMCGgcalo t he m ost massive 
group galaxies within an NEW scale ra dius of the X-ray center (see § |4.3[ l . AGN have 
been identified in Chandra X-ray data l |Elvis et al.|2009} . 

NMAD = 1.48X median[|2s - Zp|] 
^ 1.48X median[|68% uncertainty on photo-2 PDE|] 
Fraction of objects with \zs — Zp\/{1 + z^) > 0.1 



sured aggregate quantities for each group, but not a clear 
list of members and non-members. Another approach is 
to assign each galaxy a membership probability reflect- 
ing the likelihood that it belongs in a group, given some 
information about the relative number of field galaxies 
and group members. One can then determine properties 
of the group by selecting members above a given prob- 
ability threshold, or by weighting members according to 
their probability of being a member. 

We adopt this Bayesian approach to produce a group 
member catalog, which can in turn be used to measure a 
variety of properties about each group without requiring 
a new statistical background subtraction for each quan- 
tity. The selection algorithm thus assigns a probabil- 
ity of membership in a particular group to each galaxy 
given a number of observables: the projected separation 
of the group and galaxy in units of the group radius, the 
redshifts of the galaxy and group along with the typi- 
cal photo-z uncertainty for the magnitude of the galaxy, 
and an estimate of the number density of field galaxies 
relative to group members. Additionally, stellar masses 
are used to select a central galaxy from the membership 
li st, r efining the somewhat uncertain X-ray positions (see 



groups themselves. Our use of photo-z PDFs to asso- 
ciate galaxies to known groups and cluste r s is si milar to 
the method outlined by Brunner & Lubin ( 2000 1 ; we ex- 
tend this method to incorporate varying photo- z errors 
and a prior on the relative fractions of galaxies in groups 
and the field. The approach presented here was designed 
with COSMOS data in mind, but may be applicable to 
other multi-wavelength group and cluster studies, such 
as optical imaging surveys in fields with SZ or X-ray 
data. In § |5l we consider the quality of our resulting 
member catalog and how it could be modified by these 
different data sets. We attempt to keep the discussion 
here general while inserting details specific to the COS- 
MOS data when necessary. To find the center of a group, 
we start with the X-ray centroid and then refine this po- 
sition using the most massive member galaxy near the 
X-ray position, and finally we update the member list 
around the new centr al g alaxy (more details on center- 
ing are presented in 



4.3 



4.3). 



4.2. Algorithm 

In this section, we explain in detail how our selection 
algorithm works. We reiterate that our task is to iden- 
tify galaxies that belong to groups rather than to find 



and Paper II. 

We first consider the field galaxies that can contami- 
nate our selection. The background density of galaxies 
varies with position, redshift, and magnitude. We mea- 
sure the number of galaxies in redshift bins {Az = 0.05) 
and magnitude bins (starting at F814W < 21.0, then us- 
ing a width of 0.8 mag, and ending at 23.4 < F814W < 
24.2). This count excludes the volume within 3i?200c and 
Zq ± ba-p (m) around all groups in the catalog regardless 
of flags, where fh is the mean magnitude of galaxies in 
the bin. The final results are not strongly sensitive to 
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the choice of volume removed around groups. Figure |3] 
shows this field density nF(F814W,z) = dNp/dz/dlT; 
which is similar to the quantity shown in the bottom left 
panel of Figure [2l but split into magnitude bins. Figure [3] 
also shows the field density as computed by summing the 
redshift probability distribution functions of galaxies for 
comparison to the approach of directly counting galax- 
ies in photo-z bins. Despite different sampling intervals 
(Az = 0.01 for the PDFs and 0.05 for bin counting), the 
methods show excellent agreement. 

One can measure the background density locally 
around groups to account for correlated structure or 
globally across the field to increase the statistical sam- 
ple with a larger volume, reducing noise. We have di- 
vided the COSMOS field into four separate quadrants 
to look for variations in np with position and find that 
the values are in reasonable agreement across the field, 
with the density in individual quandrants deviating from 
the mean by typically no more than the Poisson errors. 
When smaller volumes are chosen to estimate the field 
density surrounding groups, the increased Poisson uncer- 
tainty swamps the constraint on locally correlated struc- 
ture. We thus opt to use the entire area to estimate 
the mean density of background galaxies as a function of 
magnitude and redshift. We discuss further the choice of 



this method of estimating the field density in § 5.3 



We next consider candidate member galaxies, con- 
structing a list of those objects in a cylinder with a pro- 
jected distance from the group center less than -R200C and 
a redshift within 3cr-p(m,„ax) of the group redshift zq, 
where mmax is the limiting magnitude F814W=24.2 and 
a-p = 0.035. The number of member candidates can be 
compared with the field density in Figure [3| to estimate 
the fraction of galaxies that are group meinbers. 

For each candidate, we compare the photo-z PDF to 
the expected redshift distributions of group members and 
field galaxies. We assume that each galaxy is either a 
group member (G) or part of the field (F), and assign a 
Bayesian membership probability using the relative sizes 
of the group and field populations as a prior to normal- 
ize the distributions. While the intial Za-p cut uses the 
photo-z value Zp to make a rough selection, here we use 
the full distribution ■p(z) for each galaxy to account for 
secondary peaks or other unusual features in the redshift 
PDF. The probability that a galaxy belongs to a group 
given V{z) can be written as 



P{g^G\V{z)) 



P{V{z)\g e G)P{g e G) 
P{V{z)) 



(1) 



The term P{V{z)\g G G) is the likelihood of measuring 
the particular photo-z PDF for a known group member. 
The prior P{g e G) = Na/iNG + Np) ^l-P{g&F) is 
based on the relative number of group and field galaxies 
in the cylinder, and 



P{V{z))^P{V{z)\g(^G)P{geG) 
P{V{z)\g e F)P{g e F) 



(2) 



is the probability of measuring V{z) for any galaxy in the 
group or field. Each factor in Equation [T] has an implicit 
dependence on magnitude which we omit here and in 
following equations for notational simplicity, but we do 
account for magnitude-dependent variations in Vi^z) and 
in the field and group densities. 



In order to compare the observed V{z) with that ex- 
pected for a group or field galaxy, we must assume a 
distribution of redshifts for each population. Since the 
intrinsic velocity dispersion of groups is smaller than the 
uncertainty in Zp we model the true group redshift dis- 
tribution as a (5-function at zq, which is then convolved 
with a Gaussian of width a-p (to) to account for photo- 
z measurement uncertainty. We have tested the effects 
of modifying the true group redshift distribution to be 
broader than a (5-function to account for intrinsic veloc- 
ity dispersion but found this correction to be negligible. 
The redshift distribution of field galaxies is assumed to be 
uniform near zq and remains unchanged after account- 
ing for photo-z measurement uncertainty. Each of these 
redshift distributions is convolved with the photo-z PDF 
7'(z) (note that jV{z)dz = 1), giving 



P{V{z)\g eG)= V{z)Af{zG,crv)dz 



P{V{z)\geF)-. 



Vizl_ 
w{a-p) 



dz 



(3) 
(4) 



where Af{zG,a-p) is a Gaussian centered on the group 
redshift with width equal to the typical V{z) uncertainty 
for the magnitude of the galaxy considered. The field 
density distribution is normalized so that the integral 
over the redshift range zq ± 3a-p is unity, so the width 
normalization parameter is w((T-p(m)) — 6(7-p{m). We 
have written these convolutions as indefinite integrals, 
but in reality they are discrete sums sampled at the red- 
shift intervals Az = 0.01 and range < z < 6 for which 
V{z) has been calculated. Because V{z) is sampled at 
intervals close to the typical photo-z uncertainty, the dis- 
tribution can effectively become a (5-function, underesti- 
mating the true redshift error which has contributions 
from template uncertainties as well as photometric un- 
certainties. So we first convolve V{z) with a Gaussian of 
width dz = 0.01 to account for these uncertainties and 
avoid sharply peaked PDFs. 

To estimate the prior, P(g S G), we begin by counting 
the number of galaxies in the range ZQ±3a-p(m), measur- 
ing Ntot — Nq + Np . The measurement of the field den- 
sity shown in Figure [3] provides an independent estimate 
oinp, which allows us to calculate an expected number of 
field galaxies in the cylinder, Np = J npdzd^l. For each 
galaxy we linearly interpolate the curve in the relevant 
magnitude bin to the group redshift, and multiply np hy 
the volume searched around the group, 67ri?200c^o'-p(TO), 
to determine Np . This value is subtracted from the mea- 
sured A^tot to determine the expected number of group 
galaxies in the cylinder, Nq- We use the estimated val- 
ues, Np and Nq, to determine P{g G G) and P{g € F), 
and Equation [l] assigns each galaxy a membership prob- 
ability between zero and one. In cases where a group is 
not well-detected in a given magnitude bin (A'tot < -^_f, 
i.e. Nq < 0), galaxies in the bin are flagged and ex- 
cluded from membership analysis. Tests in §j5]show that 
excluding these galaxies does not cause significant incom- 
pleteness in the member selection. 

It is possible for the search cylinders of different groups 
to overlap, either because they reside in neighboring po- 
sitions at the same redshift, or because of projections 
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Fig. 3. — Field density as a function of position, redshift, and magnitude. Different colored curves correspond to quadrants of the 
COSMOS field. The thickness of each curve corresponds to the Poisson uncertainty in each measurement. The dashed black line gives the 
stacked 'P(z) for all galaxies sampled at redshift intervals of O.Of , agreeing nicely with the measurement from counting galaxies in photo-z 
intervals of 0.05. The variation between quadrants is not large, so we use the mean density across the whole field, shown by the green curve 
in each of panel and repeated in the bottom right panel for each magnitude bin. 



along the line of sight within the redshift uncertainties. 
In cases where a galaxy is a candidate member of mul- 
tiple groups, each probability is recorded. A total of 
4631 galaxies are assigned high probabilities of member- 
ship (P„,em = P{g e G\V{z)) > P{g g F\V{z)), i.e. 
-Pmom > 0.5) in a group, and of these members only 163 or 
3.5% are also assigned to a second group. For most appli- 
cations we can restrict our analysis to the highest group 
membership probability for each galaxy without any sig- 
nificant change in results, but recording each probability 
assignment will aid in the study of merging groups. 

4.3. Group Centers 

The robust identification of central galaxies is a chal- 
lenging task, and relevant for a range of applications from 
satellite kinematics to stacked weak lensing to stu dying 
the most massive galaxies (e.g., Skibba et al.|2011 1. Mis- 



centering is a significant source ot systematic uncertainty 



tical groups (e.g., 


Johnston et al.||2007 


Rozo et al.||2011 


Rykoff et al.| 201] 


p. X-ray data and weak lensing otter 



los, which we use along with the galaxy content to guide 
our selection. We outline our approach to determining 
the optimal tracer of the center of mass here, and present 
our results in further detail in Paper II. 

We use the X-ray position as an initial approximation 
of a group's center, but for these faint detections the po- 
sition can be uncertain by up to the wavelet detection 



scale of 32" (^ 200 kpc at z = 0.5), so we consider other 
data to improve upon these constraints on the centers. 
Briefiy, we have defined multiple candidate centers based 
on luminosity, stellar mass, and proximity to the X-ray 
center. By measuring the weak gravitational lensing sig- 
nal stacked around each of these positions we can find 
the optimal center which maximizes the lensing signal 
at small radii. Our results indicate that this optimum 
center is the member galaxy (i.e., Pmom > 0.5) with the 
highest stellar mass within the scale radius plus the X-ray 
positional uncertainty of the X-ray center. We refer to 
this object as the MMGGscaio, for Most Massive Group 
Galaxy within the scale radius. We assign this galaxy 
to be the group center, and rerun the algorithm above 
to find members within i?200c of this galaxy for the final 
catalog. 

Traditional visual selection of group and cluster centers 
includes looking for a bright, usually early type galaxy 
near the center of the X-ray or optical distribution, per- 
haps with an extended stellar envelope. Visual inspec- 
tions of the Subaru, ACS, and XMM data support our 
objective selection, with broad agreement between the 
MMGGscaic and the objects one would traditionally iden- 
tify as central galaxies. Visual selection becomes more 
ambiguous at high redshift and for groups lacking domi- 
nant galaxies, while our selection algorithm makes an ob- 
jective choice. In a few percent of cases the MMGGscaic 
disagrees with a visually identified central galaxy due to 
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photo-z error or because of a significant offset from the 
X-ray position putting it outside tlie scale radius. We 
do not amend these cases, sacrificing a small degree of 
accuracy for a uniform and objective selection. 
The selec tion of group centers used here is differ- 



ent than in Leauthaud et al. (20101, which employed a 



weighting based on stellar mass and distance to the X 
ray position. Of the gro ups that have a confiden t central 
galaxy assignment from Leauthaud et al. ( 201_0|) and also 
satisfy the quality cuts for clean groups in § [5} 80% are 
assigned the same central gala xy by the two methods 



9% of the centrals identified by Leauthaud et al. (20101 



are too distant from the X-ray center tor our method to 
select, and 4% are not identified as members with the 
current alg orithm. In these cases of disagreement, the 



selection of Leauthaud et al. (20101 tends to favor more 
massive galaxies that are farther from the X-ray centroid 
than the selection used here, with average differences of 
0.2 dex in stellar mass and 55 kpc in distance to the 
X-ray centroid. 

5. PURITY AND COMPLETENESS 

Any selection of group members will have some fraction 
of false positives, interlopers selected as members that do 
not belong to a group, and false negatives, true member 
galaxies missed by the selection. To measure properties 
of member galaxies, we can weight each galaxy by its 
membership probability to account for these uncertain- 
ties. But we must test the reliability of those membership 
probabilities, and furthermore, for some applications we 
wish to define a set of galaxies exceeding a membership 
probability threshold with a reasonable degree of purity 
and completeness. 

Purity and completeness are measures of overlap be- 
tween the sample of selected members and the popula- 
tion of true members. We define the purity of the sample, 
p, to be the fraction of selected members which are also 
true members. The completeness of the sample, c, is the 
fraction of true members which are selected. Interlopers 
are objects which are selected but are not true members 
and missed galaxies are objects which are not selected 
but are true members. Formally, 



selected 



interlopers 



selected 



iVt, 



(5) 
(6) 



A^intoriopors which Can be 



We can use the values of p and c to estimate A'truo using 

Atruc Agclcctcd Ajnisscd 

rearranged into 

-'^ selected ^ 

using Equations [5] and [6j This correction factor, p/c, can 
be used to remove bias in the estimate of the intrinsic 
number of group members, A'truo, if we understand the 
purity and completeness of the selection algorithm. 

To measure the purity and completeness of our mem- 
ber selection, we must have some way of telling which 
galaxies truly belong to groups. For our application, we 
use the subsample of objects with spectroscopic redshifts 
as well as mock catalogs to obtain knowledge of group 
membership that is independent of our photo- 2: selec- 
tion. The galaxies with spectroscopic redshifts allow us 



to test the photo-z selection method on the same cata- 
log, directly probing the effect of photo-z uncertainties. 
But constraints on p and c are limited by the sparse- 
ness of spectroscopic coverage, and biases could be intro- 
duced since spectroscopic coverage is not representative 
of the full range of galaxies in the group sample. Fur- 
thermore, even spectroscopic selection of group m embers 
can have co ntamination and incompleteness (e.g., Gerke 
eraL][2005| . 

We perform further diagnostic tests using mock cata- 
logs from N-body simulations described in § |5.2| Mock 
galaxies are prescribed to occupy halos according to a 
halo occupation distribution (HOD) model constrained 
by measurements of clu stering, lensing, and stellar mass 
functions in COSMOS ( [Leauthaud et al.||2011a"lbl ). Af- 
ter running the selection algorithm on a mock catalog, 
we can estimate its purity and completeness by compar- 
ing the results with the input list of group members. 
The mocks allow us to study greater volumes than the 
observed region, increasing statistical precision and pro- 
viding estimates of the effects of sample variance for 
the volume probed. Mocks also give direct knowledge 
of galaxy group membership in real space without the 
redshift space distortions that mar spectroscopic selec- 
tion, so we can study how the selection algorithm would 
perform on data sets with different errors in redshifts 
or positions. However, caution must be taken to ensure 
that the mock galaxies adequately represent the reality 
of correlated structure for all relevant properties, partic- 
ularly in their distribution of positions, masses, and halo 
occupation. 

In the following sections we describe in more detail our 
diagnostic tests on the selection algorithm using spectro- 
scopic redshifts and mock catalogs. We begin by testing 
the tradeoff between purity and completeness for differ- 
ent membership probability thresholds, and proceed to 
study the principle sources of contamination and incom- 
pleteness in our selection. 

5.1. Spectroscopic Tests 

Here we consider the subset of galaxies with spec- 
troscopic redshifts to measure the purity and complete- 
ness of the selection and study the effect of photo-z er- 
rors. A "true" member in this case is defined to be 
a galaxy within i?200c of the X-ray center and with 
c|zs — ^g| < 2crv(Af200c, z){1 + zg), where c is the speed of 
light and crv(M2ooc , z) is the velocity dispersion from the 
simulations of Evrard et al. (2008), assuming that the 
velocity bias between galaxies and dark matter is unity. 

As we vary the membership probability threshold for 
photo- 2: selection, we can see a tradeoff between purity 
and completeness shown in Figure H] with black points 
from the spectroscopic test. Error bars show the stan- 
dard deviation of 1000 bootstrap samples of the spectro- 
scopic catalog. Restricting the member list to sources 
with membership probability Pmem > 0.9 gives a purity 
and completeness of 80% and 19% respectively. Lowering 
the membership threshold increases completeness while 
decreasing purity. In later sections, we use a threshold 
of -Pmem > 0.5 as a compromise between these compet- 
ing factors, which for the spectroscopic test produces a 
purity of 69% and a completeness of 92%. 

To further study the quality of the membership selec- 
tion, we can measure trends in purity and completeness 
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against other properties, seen in Figure [5] In this fig- 
ure, we show how the selection performs lor galaxies of 
different redshift, magnitude, stellar mass, group-centric 
distance, and group halo mass, by measuring the purity 
and completeness of objects assigned P,„cm > 0.5. Fig- 
ure[6]shows the same tests for color and morp hology. The 
results are discussed in more detail in § |5.3[ but we can 
see that the selection quality does not vary significantly 
with redshift or group mass, but does degrade in the out- 
skirts of groups and for faint, low-mass galaxies, which 
also tend to have blue colors and late type morphologies. 
We have tested the influence of target selection effects 
on these results by restricting the spectroscopic sample 
to zCOSMOS galaxies which were uniformly selected at 
< 22.5. The purity and completeness measurements 
are consistent within the error bars of the full sample, 
but have slightly larger uncertainties due to the smaller 
sample size. 



3 

CL 
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Fig. 4. — Purity and completeness for different membership prob- 
ability thresholds (Pmcm > {0.1,0.3,0.5,0.7,0.9}) as measured by 
the spectroscopic subsample (black crosses) and mock catalogs 
(blue circles, mean of ten lightcones). Error bars are the stan- 
dard deviation from 1000 bootstrap samples of the spectroscopic 
catalog. The cyan shaded band is the region spanned by the ten 
mock lightcones. 



5.2. Mock Catalogs 

We use numerical simulations to construct a series of 
mock catalogs for a GOSMOS-like survey to test the reli- 
ability of our member selection. Mocks are created from 
a single simulation (named "Consuelo"), part of the Las 
Damas suite (McBride et al., in prep.jj^ Consuelo is 
a box of 420 h^^ Mpc on a side with 1400"^ particles 
of mass 1.87 x 10^ M© and a softening length of 
8 h^^ kpcQ This simulation can robustly resolve halos 
with masses above ^ 10^^h~^ Mq which corresponds to 
central galaxy stellar masses of ~ 10^'^ M©, well- 
matched to our completeness limit of F814W=24.2 at 
z = 0.2 (see Figure [T|. 

We extract ten light cones from the Consuelo simula- 
tion that have the same area as COSMOS and individ- 
ually non-overlapping volumes. Halos within the simu- 



lation are identified with a friends-of-friends (FOF) halo 
finder ( Davis et al.[1985 ) with a linking length of 6 = 0.2. 
For typical halos in the mass range we consider, FOF 
masses and spherical overdensity masses (defined within 
a radius where the mean density is 200 ti mes the back- 
groun d) typically agree within ^ 10 — 20% ( Tinker et al. 
2008[ ); we thus only convert from background to criti- 
cal overdensity to obtain M2ooc- Hal os are populated 
with gala xies using the HOD model of [Leauthaud et aT 



( 2011a|b ) that simultaneously fits the stellar mass tunc 
tions, galaxy clustering, and galaxy-galaxy lensing sig- 
na ls of COSMOS. We adop t the z - 0.6 HOD model 



(2011b) with the following param- 
ot that paper: log(Afi) = 12.725, 
B = 0.466, S = 0.61, 7 = 1.95, 
= 1.65, Bsat = 9.04, ^eut = 0.59, 
1. Details regarding the parame- 



of Leauthaud et al. 
eters from I'able 5 
log(M,,o) = 11.038, , 
fiogM, — 0.249, Scut 

/3sat = 0.740, asat = , _ _ 

ters in this HOD mod el can be found inlLeauthaud et al.l 
(2011a). As shown in Leauthaud et al. (2011b), there is 
a small amount of redshift evolution in this parameter 
set from z ^ 0.2 to z ^ 1. However, the redshift evolu- 
tion should not have a large impact on our assessment 
of the completeness and purity of the group membership 
selection and so we neglect the redshift evolution of the 
HOD in this work. 

Galaxies are assigned cosmological redshifts as well as 
mock spectroscopic redshifts which include the effect of 
peculiar velocities from the velocity dispersion within ha- 
los. Photometric redshifts are drawn from a Gaussian 
distribution centered around the spectroscopic redshift 
with width equal to the photo-z uncertainty for that 
magnitude. A Gaussian P{z) is then centered at Zp with 
the same width, and sampled at the same redshift in- 
terval as the PDF for real galaxies. We do not include 
catastrophic photo-z errors which are shown in Table [l] 
to be a small fraction of the sample. 



(2011a 



^ Details regarding this simulation can be found at [http : //Iss . | 
|phy . vanderbllt . edu/lasdamas/slmulatlons .html' 

~ We use Hq = 100 h km s^^ Mpc^^ in this paragraph only. 



The HOD model of Leauthaud et al. 
stellar masses to mock galaxies but does not assign mag- 
nitudes or colors. In order to apply a similar magnitude 
cut to the mock galaxies as used in the selection algo- 
rithm, we assign F814W magnitudes to mock galaxies. 
For each mock galaxy, we construct a galaxy sample from 
the COSMOS data that is matched in redshift and stellar 
mass in bins of Az = 0.02 and A log(M,^/MQ) = 0.2. An 
F814W magnitude is assigned to each mock galaxy by 
randomly drawing a magnitude from the matched sam- 
ple. We do not assign colors or morphologies to mock 
galaxies since the dependence of these properties on red- 
shift and environment are not well-constrained. We will 
rely on our spectroscopic sample in order to determine 
the completeness and purity of the group membership 
selection as a function of color and morphology instead 
of using mock catalogs. 

Mock halos are given the redshift of the central galaxy 
and X-ray luminosities according to the mean Lx — M2ooc 
relation of Leauthaud et al. (2010). To mimic the posi- 
tion uncertainties ot the X-ray detections, XFLAG quality 
flags 1 or 2 are assigned randomly in proportion to their 
appearance in the COSMOS group catalog. The nom- 
inal group center is offset from the central galaxy with 
a Gaussian scatter of 32" for XFLAG = 2 halos which is 
reduced by the measured flux significance for XFLAG — 1 
halos, and we assume a typical 5a flux measure ment. 
The impact of centroiding errors is investigated in § |5.4 



14 



George et al. 



1.0 



^ 0.6 



0.4 
1.0 

0.8 

0.6 

0.4 



Q. 





71— r-i— I— i-j-T— I— I— I— I— i—i— I— I— I— n 


:— 1 1 1 1 1 1 1 1 1 1 1 1 1— : 


i 





1 

H Mocks 
i Spectra 


"I 










■ 1 ■ ' ■ ' 1 ' ■ ■ ■ 1 ' ■ ' ■ 1 1 


. ' 1 ' i i ' 1 ' i i ' 1 ' . 


.1 i i i 1 i i i 1 i i i 1 i i i 1 i i i 1 
l\ 


. ' 1 i i i ' 1 ' i i ' 1 ' i i '■ 

I , 


I 


















\l 




— Mocks 








\l 




I Spectra 











0.0 0.2 0.4 0.6 0.8 1.0 18 20 22 24 9 



10 



11 0.0 0.2 0.4 0.6 0.8 1.013.0 13.5 14.0 



F814W magnitude Stellar Mass [logCM^Mg)] 



R/R„ 



Halo Mass ^oq(U^f,aJUt^\ 



Fig. 5. — Completeness (top row) and purity (bottom row) of the galaxy membership selection as measured by the spectroscopic subsample 
(points with error bars) and mock catalogs (shaded bands) for galaxies with Pmcm > 0.5. Error bars are the standard deviation from 1000 
l)ootstrap samples of the spectroscopic catalog, and shaded bands show the range spanned by the ten mock lightcones, while the solid black 
curve represents the mock mean. Bins were chosen to measure a roughly constant number of galaxies for each property tested while still 
representing the range of observed properties. 

Next we run the membership algorithm described in 
§ |4] on the mock galaxy and halo catalogs, associating 
galaxies with halos. We can perform the same purity 
and completeness tests as with the spectroscopic sam- 
ple above, but this time we know the halo membership 
a 'priori. The results from these mock catalog tests are 
presented alongside those for the spectroscopic subsam- 
ple as colored bands in Figures |4] and [5] 
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Fig. 6. — Completeness (top row) and purity (bottom row) of 
the galaxy membership selection as measured by the spectroscopic 
subsample for galaxies with Pmcm > 0.5. Error bars are the stan- 
dard deviation from 1000 bootstrap samples of the spec trosco pic 
catalog. M orphological classes are defined from ZEST (Scarlata 
|et al.||2007^ ; the early type category includes ellipticals (type=T) 
and bulge-dominated disks (type=2.0), the late type category in- 
cludes the remaining type=2 sources, and irregulars have type=3. 



5.3. Sources of Error 

Results from the tests on spectroscopic data and mock 
catalogs above can differ because the spectroscopic sam- 
ple is weighted toward bright objects and because our 
knowledge of true membership in the spectroscopic data 
is limited by redshift-space distortions, while member- 
ship in the mock catalogs is known by design. The gen- 
eral agreement seen in Figures |4] and [S] between these 
tests of membership quality is encouraging, and it sug- 
gests that the biases are modest and that the mock cat- 
alogs accurately represent the properties of real galax- 
ies that we wish to study. The normalization of the 
purity and completeness curves for the spectroscopic 
test has a degree of freedom in the velocity width used 
to determine whether a spectroscopic redshift is con- 
sistent with a group redshift. We used the criterion 
c\zs — zq\ < 2crv(M, z)(l -I- zq) for spectroscopic mem- 
bership; a broader velocity range for the spectroscopic 
test would result in a higher measured level of purity 
and lower completeness in the photo-z selection, and the 
converse holds for a smaller velocity range, shifting the 
curves up or down. Though the absolute measure of pu- 
rity and completeness in the spectroscopic tests holds 
some degree of arbitrariness, the relative trends shown 
in Figure [5] are in general agreement with the mocks. 



Galaxies in X-ray Groups 



15 



with some offsets likely due to sampling bias and redshift- 
space distortions. We study the effects of redshift-space 
distortions on member selection in the limit of a com- 
pletely spectroscopic survey in § |5.4[ 

Information from the spectroscopic tests has the ad- 
vantage that it can probe member selection effects due 
to properties that cannot easily be modeled (e.g., galaxy 
color and morphology), and these tests directly measure 
the effects of photo- 2; errors on our selection of galax- 
ies in the same set of groups. We see in Figure |6| that 
the trends of selection quality with color and morphol- 
ogy parallel the trends with magnitude and stellar mass 
from Figure [5j We have shown in § |3.2| that photo-z 
quality is not strongly affected by color or morphology, 
and no other inputs to our selection algorithm explicitly 
depend on these properties. We infer that the lower com- 
pleteness and purity seen for faint, low mass, blue, and 
late type galaxies is driven by two effects; fainter galax- 
ies have larger photo-z uncertainties, and galaxies in this 
population tend to live outside of dense groups so that 
they are more likely to be contaminants when selected. 

Because only a fraction of objects have spectroscopic 
redshifts, the uncertainties can be large. Tests with mock 
catalogs alleviate this issue and provide an estimate of 
the sample variance in our selection due to the finite size 
of the COSMOS region. An additional advantage of the 
mocks is that the central galaxy of each halo is known, 
so we can test the success rate for identifying these ob- 
jects. We find that 77% of central galaxies are correctly 
identified as the MMGGgcaic galaxies in the correspond- 
ing halos, 12% are misidentified as satellites because the 
central galaxy is not the most massive member near the 
centroid, 5% are misidentified as satellites because the 
assigned centroid error puts the galaxy outside of the 
search region, and only 5% are assigned to neighboring 
groups or the field due to photo- 2; errors. While the HOD 
used to create the mocks allows for satellite galaxies to 
be more massive than centrals due to scatter in the rela- 
tion between stellar mass and halo mass, the fraction of 
groups where this occurs is sensitive to the parametriza- 
tion of the HOD model and is not well-constrained. The 
problem of identifying group centers will be discussed in 
more detail in Paper II. 

For the full sample of mock galaxies with Pmom > 0.5, 
we find a mean purity of 67% and completeness of 92%. 
Looking at Figure [5l it is clear that the dominant source 
of impurity comes irom galaxies in projection near the 
outskirts of groups. We can attribute this contamination 
to the fact that the density of true members falls steeply 
as a function of distance from group centers while our 
membership algorithm selects galaxies uniformly out to 
^200c- Faint galaxies are another source of impurity since 
their photo-z errors are larger than average. Galaxies 
with lower masses and bluer colors are more common in 
the field than in dense environments (see §[7|, so a higher 
contamination fraction from these populations is to be 
expected. There is also a slight dependence on halo mass, 
since the density contrast between the field and groups is 
smaller for low mass halos, lowering the assigned mem- 
bership probabilities of candidate members and reducing 
the completeness of the selection. These factors motivate 
the use of matched filters in finding groups and clusters 
when the properties of their galaxy populations are well- 
characterized; we have not employed such filters to avoid 



biasing our sample and because galaxy properties in this 
range of halo masses and redshifts are not thoroughly 
constrained. 

The covariance between these galaxy properties makes 
it challenging to isolate their influence on the contam- 
ination fraction. For example, the correlation between 
the stellar mass and brightness of a galaxy means that 
the corresponding panels of Figure [5] are related and not 
independent probes of contamination sources. The sim- 
plest way to increase the purity of the group sample is 
to consider only galaxies at smaller distances from the 
group center than the cut of i?2ooc used here. Restrict- 
ing the mock sample to i? < 0.5i?200c results in a mean 
purity and completeness of 84% and 92%, respectively. 

An alternative way to address the contamination and 
incompleteness of the selection would be to apply cor- 
rection factors to the member selection as a function of 
these properties, as in Equation[7j This would amount to 
introducing strong priors to the membership algorithm 
based on our HOD model, limiting the independence of 
the sample. In testing this approach however, we have 
noticed that the correction factor as a function of group- 
centric radius is not significantly tied to other properties 
such as magnitude, stellar mass, or color, indicating that 
the contamination is due more to geometry than distinct 
populations of galaxies. This suggests that we can reli- 
ably study the relative radial trends of these properties, 
though the absolute radial trends are subject to uncer- 
tainties in the correction factor. 

We c an compare the mem ber selection used here with 
that of Giodini et al. ( 2009 1 , who used a statistical back- 
ground suEtraHioiTonTEie same body of data to deter- 
mine galaxy membership and estimate the total stellar 
mass in groups. Because the statistical background ap- 
proach does not individually assign galaxies to groups, we 
cannot directly compute the purity and completeness of 
the selection, but we can compare the total stellar mass 
estimat es from the two selection methods to the mock 
values. [Gio dini et al.| (|2009|) selected candidate mem- 
ifithin 



et al.| ( |2009[ ) 

bers within a projected radius i?5ooc of X-ray centroids 
and 0.02 x (1 -|- z) of the group redshift, and estimated 
a mean foreground/background contribution in 20 non- 
overlapping field regions of the same size and redshift. 

We run both member selection methods on the mocks, 
ap plying to each method the same corrections described 



by Giodini et al. (20091 to deproject the cylindrical 
search volume into a sphere of radius i?5ooc and to ac- 
count for stellar mass contributions below our sensitivity 
limit, adapted to the stellar mass function and limits of 
our mocks. The mean stellar mass content in groups re- 
covered using their method is 3% lower (3% higher) than 
the input mock value in the redshift range 0.2 < z < 0.5 
(0.5 < z < 1.0). With the same corrections, our selec- 
tion method estimates the mean stellar mass to be 3% 
higher (9% higher) than the mock values over the same 
redshift intervals. The typical scatter of 35% between the 
recovered values and the input values for a given group is 
much larger than the offsets for both methods, but with 
these tests on mock catalogs we could remove the small 
biases in future measurements. The mean stellar masses 
inferred by the two methods happen to be quite similar 
because they are typically dominated by massive galaxies 
for which membership assignment is relatively straight- 
forward. However, we note that the full membership 
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selected can be quite different because our approach op- 
timizes group centers using the weak lensing signal and 
handles magnitude- dependent pho t ometr ic redshift un- 
certainties, whereas Giodini et al. (2009) use the X-ray 
centers and a fixed redshitt window. 

We can also test different methods of estimating the 
field density to see how it influences our member selec- 
tion. Our selection algorithm estimates Np from the 
mean density across the whole field, but smaller re- 
gions could instead be used to estimate the local density 
around individual groups. While the local density esti- 
mate has the advantage that it traces correlated struc- 
ture around groups, it does suffer from greater shot noise 
than the density estimated over a larger volume. We 
have tested our approach by using annuli centered on 
each group with inner and outer radii of 2i?2ooc and 
5i?200c, while keeping the rest of the selection algorithm 
the same. With this approach, the typical field density 
is higher due to clustering around groups and the result- 
ing membership probability is slightly lower (increasing 
the field density by a factor of 2 typically lowers the 
membership probability by only ^ 20%), but the pu- 
rity and completeness of the sample are essentially un- 
changed, and the fraction of members crossing a thresh- 
old of Pmem > 0.5 between samples is less than 10%. 
We obtain similar results when s ubstituting the back- 
ground estimation method used by Giodini et al. ( 2009 1 
for our field density prior, so the selection algorithm is 
not strongly sensitive to the approach used for back- 
ground estimation. 

5.4. Applicability to Other Surveys 

In view of other surveys which will search for groups 
and clusters in multi-wavelength data, and to better 
characterize the advantages or shortcomings of the COS- 
MOS data used in this analysis, we test our selection 
algorithm on mock catalogs with different levels of un- 
certainty in redshift and centroid measurements. We 
consider five hypothetical data sets: a full spectroscopic 
survey where all galaxies have the typical redshift uncer- 



tainty in zCOSMO; 
spectroscopic survey 



oi (Jz — 3.7 X 10 ^, a low-resol ution 
ike PRIMUS JCoilet al.|20To| with 
redshift uncertainties of cr^ = 0.005, a photometric sur- 
vey with fewer band s and large r photo-z error s like SDSS 



(Csabai et al. 2003) or DES (Banerji et al. 2008) with 



(Tz = 0.05, a deeper X-ray survey with more precise cen 
troids of 3", and a lower resolution X-ray or SZ survey 
with centroid uncertainties of 1'. In the first three mock 
surveys we vary only the redshift uncertainty and ap- 
ply the same centering uncertainty as the fiducial COS- 
MOS mocks described in § |5.2| assuming similar X-ray 
detections. In the final two mock surveys we use the 
magnitude-dependent redshift uncertainties of the COS- 
MOS mocks and assign centroiding uncertainties, ax, in 
each dimension on the sky. We offset the nominal cen- 
troid from the central galaxy in each dimension by a 
random value drawn from a Gaussian of width ax- In 
all cases we keep the same group and galaxy detection 
limits as in the COSMOS data. 

Figures [7] and |8] show the purity and completeness ob- 
tained when applying our member selection algorithm to 



these mock surveys, in a manner similar to Figure [5] We 
reiterate that these statistics describe the accuracy of the 
assignment of galaxies to groups, and not the detection 
of groups themselves. The figure illustrates that purity 
and completeness improve as redshift and centroid uncer- 
tainties decline. A number of other points can be made 
about these results: 

• Deeper and more complete spectroscopic coverage 
would improve our member selection, increasing 
the purity of the sample from ~ 70% with photo-zs 
to ~ 85%. Improvements for completeness would 
mainly be gained from faint galaxies near our mag- 
nitude limit. 

• Among spectroscopic redshifts, high precision is 
not critical. The completeness of the CTz = 
3.7 X 10~^ and 0.005 samples are nearly identi- 
cal, and the higher precision spectra provide only 
a modest improvement in sample purity over the 
low-resolution spectra, from ^ 80% to ~ 85%. 
Once the redshift measurement uncertainty be- 
comes comparable to the magnitude of intrin- 
sic redshift distortions due to peculiar velocities 
in groups, additional spectral resolution does not 
greatly improve our ability to identify members. 
PRIMUS data in the COSMOS field wiU improve 
upon the existing sampfing of zCOSMOS, but we 
note that the completeness limit for that survey is 
i — 22.5 with sparse sampling to i = 23.5, still 
shallower than our photo-z depth of F814W=24.2. 

• The precise photometric redshifts available in the 
COSMOS field are critical for identifying members 
using our approach. Redshifts that are less accu- 
rate or precise show significantly reduced purity 
and completeness. 

• The precision of X-ray centroids for COSMOS 
groups is quite sufficient for member selection. Im- 
proving the positional uncertainty by roughly a fac- 
tor of eight from the mean COSMOS value results 
in only a few percent improvement in completeness 
and a negligible gain in purity. Conversely, less pre- 
cise centers (such as those available from SZ mea- 
surements) produce a sample with lower purity in 
the central region (^ 85% instead of ^ 95%) and 
significantly lower completeness (~ 75% instead 
of - 90%). Though the existing COSMOS cen- 
troids are adequate for assigning member galaxies 
to groups, we note that several aspects of groups 
could still be studied with deeper X-ray or SZ data 
including physical offsets between central galaxies 
and hot gas, and the relationships between temper- 
ature or entropy with other group properties. 

• Note that we do not optimize our selection algo- 
rithm for these hypothetical data sets. Combin- 
ing catalogs built fr om different observables (e.g., 
Cohn fc White|2009 ), and other techniques such as 



http : //archive . eso . org/archive/adp/zCOSMOS/VIMOS_ 
|spectroscopy _vl .0/ index. html 



iterative centering or matched filters could improve 
results. 

We can compare the results of our mock spectroscopic 
selection to other methods in the literature. We must 
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note that our definitions of purity and completeness re- 
fer to the success rates for assigning members to known 
groups, while previous spectroscopic group-finding ef- 
forts have typically quantified the purity and complete- 
ness of the identified group catalog in addition to the 
galaxy membership assignment. In our mock tests, we 
have implicitly assumed that the identification of groups 
is pure and complete. It is also difficult to make direct 
comparisons across surveys because of differences in data 
sets, limiting depths, and mock catalogs. However, look- 
ing briefly at the quoted purity and completeness of spec- 
troscopic group catalogs, we can assess our algorithm and 
see the advantage to assigning group membership when 
the existence of a group is already know (e.g., from an 
X-ray detection). 

Using a tesselation method to find galaxy groups in 
DEEP 2 with a limiting galaxy magnitude of Rab = 24.1, 
Gerke et al. ( [2005 |) attained a mean interloper fraction 
(analogous to our impurity, 1 — p) of // = 0.458 ± 0.004 
and a mean galaxy success rate (analogous to our com- 
pleteness) of S'gai = 0.786 ± 0.006 in their mock tests. 
They reported a one-way group identification purity of 
Pi = 0.545 ± 0.005 and c ompleteness of Ci = 0.782 ± 
0.006. Knobel et al. ( 2009 1 found that a friends-of-friends 
approach performed better than the tesselation method 
for identifying groups in zCOSMOS to a limiting mag- 
nitude of Iab = 22.5, and reported values of // = 0.29, 
^gai = 0.84, Pi = 0.66, and Ci = 0.81 from their mocks. 
These values are for groups with N^cm > 2, and while 



Gerke et al. ( [2005^ showed values for group purity and 
completeness tha t were roughly const ant with group ve- 
locity dispersion, Knobel et al. ( 2009 ) showed that each 
of these statistics improved when restricting the sample 
to higher richness groups, flattening out for groups with 
^mcm ^ 5. Disentangling the effects of group identifi- 
cation from galaxy membership assignment is difficult, 
but since the reported S'gai /Ci and (1 — fi)/Pi are both 
approximately unity, it appears that the main challenge 
in assigning galaxy membership with these algorithms is 
in identifying real groups. This fact illustrates the ad- 
vantage of combining group-finding methods to ensure a 
reliable sample of groups before assigning members. 

6. MEMBER CATALOG 

In the spirit of public releases of COSMOS data, we 
make our membership assignments available as machine- 
readable files through the NASA/IPAC Infrared Science 
Archive (IRSA^ These data include galaxy positions, 
redshifts, stellar masses, colors, and membership proba- 
bilities, along with group identifications. For each group 
we provide the X-ray position, flux, and luminosity, along 
with the rcdshift, halo mass, quality flags, and the posi- 
tion and stellar mass of the central galaxy MMGGscaic- 
For reference, the basic parameters describing the cat- 
alog are compiled in Table [2j For analyses requiring a 
clean selection of galaxy groups, we restrict the sample 
to groups with xflag = 1 or 2 and the mask, POOR, and 
MERGER flags blank to ensure that groups and members 
have been reliably identifled; in the group catalog we de- 
fine a new property, flag.include, to encode this com- 
bination of selection cuts. When a pure and complete 



® Upon publication, see jhttp : //irsa. Ipac ■ caltech. edu/| 
|Missions /cosinos .html 



sample of members is needed, we select galaxies with 
^'mcm > 0.5 in the inner regions of groups, R < 0.5i?200c, 
with stellar masses above our sample limit shown in Fig- 
ure [H 

In addition to the catalog described above using photo- 
metric redshifts, we have also produced a catalog replac- 
ing photo-zs with spectroscopic redshifts when available. 
We use the same selection algorithm and replace Viz) 
from the photo-z with a Gaussian of width equal to the 
typical uncertainty in zCOSMOS, az — 3.7 x lO"'*, and 
sample each distribution at intervals of 10~^ in redshift. 
This catalog has better purity and completeness than the 
photo-z catalog because of the improved redshift accu- 
racy, but the selection is less homogeneous because spec- 
troscopic sampling is not representative or complete. 

7. ANALYSIS AND DISCUSSION 

With the catalog of group members identifled and the 
purity and completeness of the sample characterized, we 
provide a first look at the properties of galaxies in these 
groups. Here we present an analysis of the colors of group 
members relative to the field. Future papers will study 
member properties in more detail, including galaxy mor- 
phologies, star formation rates, and AGN activity with 
respect to group properties like redshift, halo mass, and 
group-centric distance. 

Figure [9] sho ws the u n extinc ted rest-frame NUV-R col- 
ors from Ilbert et al.| (|2010j for group members. We 
use the clean sample of groups, selecting members with 
Pmcm > 0.5 within i?200c of the group center. The appar- 
ent banding in colors is due to the finite number of tem- 
plates used. We call galaxies identified as the MMGGscaic 
"centrals" with the other members as "satellites." We see 
a bimodal distribution in color space for both central and 
satellite galaxies, though redder colors are more common 
than blue for both types of members. In the margin plots, 
we show the distributions of colors and stellar masses for 
centrals and satellites, as well as field galaxies not as- 
signed to any X-ray detected group (Pmcm = 0). The 
field sample has been selected to match the redshift dis- 
tribution of group members in bins of dz = 0.1. Although 
the color distribution of field galaxies is also bimodal, it 
is clear that bluer galaxies are more common in the field 
than in groups, a well-known result in clusters and dense 
envir onments over a ra nge of mass scales and redshifts 
(e.g. , [Gerke et al.|[2007 1 . 

We also see that there exist a number of blue centrals, 
which are of interest because they suggest that star for- 
mation can persist or be reactivated in the centers of 
dense groups, or that AGN exist there. The set of red 
points in Figure [9] omits ambiguous cases where there is a 
more massive galaxy in the outskirts of a group or where 
the MMGGscaic differs between the photo-z-only catalog 
and the one supplemented with available spectroscopic 
redshifts; 79% of groups satisfying the quality cuts of §|6] 
have an unambiguous central according to these criteria. 
While the majority of this sample of centrals are red (77 
out of 102), there are 5 centrals with blue colors indica- 
tive of active star formation or AGN, and 20 centrals 
with colors indicating intermediate activity. This pop- 
ulation warrants further study to verify that they are 
accurate centers and to determine what environmental 
factors could contribute to the star formation or AGN 
activity. 
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Fig. 7. — Completeness (top row) and purity (bottom row) of the galaxy membership selection for hypothetical surveys with different 
redshift uncertainties according to the legend. Each curve represents the mean of ten mock lightcones. The fiducial survey is the same as 
that plotted in Figure[5]for COSMOS. Note that the completeness curves for cjz = 0.0004 and 0.005 lie atop one another. 



TABLE 2 

Basic catalog properties 



Property 



Value 



Field coordinates (J2000) 
Group redshift 
Galaxy magnitude 
Halo mass 
X-ray luminosity 

Ngroups 

Ngroupa (XFLAG = 1,2) 

Ngroups (clean groups'*) 

Nniem (^mcm ^ 0.5) 

Nniom(Pmcm > 0.5, clean groups) 

Nmom(Pmem > 0.5, < 0.5i?200c, 
Nmom(Pmem > 0.5, < 0.5i?200c, 



log(M,^/M0) > 10.3) 
log(M*/M0) > 10.3, clean groups) 



R,A=(149?4, 150?8), Dec=(l?57, 2?90) 

< 2G < 1 

F814W < 24.2 
12.8 < log(M2ooc/M0) < 14.3 
41.3 < log(Lx/erg s'^) < 44.1 
211 
165 
129 
4631 
3406 
868 
657 



XFLAG = 1, 2, MASK=POOR=MERGER=0 

7.1. Environmental Dependence from z = 0.2 to 1 

The higher fraction of blue galaxies in the field com- 
pared to groups shown in Figure [9] indicates that star 
formation is less common in dense environments. As dis- 
cussed in the introduction, much work has been carried 
out to determine whether this well-known effect is due 
to a physical process acting on galaxies in dense envi- 
roments to suppress their star formation rates, or due 
to the intrinsic properties of galaxies that exist in these 
regions. 

To distinguish between the possibilities of environmen- 
tal influence and innate differences, we compare galaxy 
colors in group and field environments within fixed stel- 
lar mass bins. We measure t he fraction of galaxies of 
the quiescent type defined by Ilbert et al. (2010), i.e. 
those with M{NUV) - M{R) > 3.5. These are unex- 
tincted rest-frame colors from the spectral template that 
best fits each galaxy's SED, allowing us to study intrin- 



sic colors that are related to specific star formation rates 
without the obscuring effects of dust. The correction is 
important because galaxies in low density environments 
have a higher dust conte nt, even among massive galaxies 
( [Kauffmann et al.||2004[ ). 

'I'he traction of red galaxies in fixed stellar mass bins 
is plotted for three redshift ranges in Figure 10 For 
group members, we consider only galaxies witn mem- 
bership probability Pmcm > 0-5 within a projected dis- 
tance of 0.5i?2ooc of the center of groups in the mass 
completeness-limited bins of Figure [T] The radial cut 
on the group sample i s to avoid contamination in the 
outskirts discussed in § 5.3 The plot includes both cen- 
trals and satellites; excluding centrals leaves the results 
essentially unchanged because the sample is dominated 
by satellites even in the highest stellar mass bin plot- 
ted. The fraction of red galaxies in this population is 
plotted against the mean stellar mass for each bin. We 
also plot the red fraction in individual groups for galax- 
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Fig. 8. — Completeness (top row) and purity (bottom row) of the galaxy membership selection for hypothetical surveys with different 
centroid uncertainties according to the legend. As in Figure [t] each curve represents the mean of ten mock lightcones, and the fiducial 
survey is the same as that plotted in Figure[5]for COSMOS. 

field sample plotted matches the the redshift bins used 
for group members; matching the redshift distribution 
on finer scales results in quenched fractions that are at 
most a few percent different from those plotted. 

We also see evidence for an increase in the red fraction 
with decreasing redshift among low mass group galaxies. 
Redshift trends are somewhat difficult to interpret be- 
cause the range of group masses used is different in each 
redshift bin (see Figure [T]) and the color cut does not 
account for evolution, so we leave a detailed analysis for 
future work. Because our galaxy sample is magnitude- 
limited, the low mass bins at high redshift are incomplete 
and plotted as open symbols. We make no corrections 
for incompleteness here, which likely leaves the sample 
in these bins biased toward the detection of blue galax- 
ies that tend to have lower mass-to-light ratios than red 
galaxies, so we consider the red fractions in incomplete 
bins to be lower limits. 

We detect a clear dependence of galaxy color on en- 
vironment, even at fixed stellar mass and high redshift. 
Figure 10 uses the member catalog determined with pho- 
tometric redshifts only, but including the available spec- 
troscopic redshifts does not affect the results. Our re- 
sults are similarly insensitive to the choice of a prob- 
ability threshold for membership; changing the cut on 
^mem > 0.5 to 0.3 Or 0.7 or weighting objects by Pmem 
instead of choosing a threshold moves the red fractions 
by no more than a few percent i n any bin. Using the color 
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Fig. 9. — Stellar masses and unextincted rest-frame template col- 
ors for central galaxies (red diamonds) and satellites (gray dots). 
We plot only the unambigous centrals, i.e. those designated as the 
MMGGscalc in groups which do not have a more massive galaxy 
in the outskirts or a discrepancy between identification with spec- 
troscopic and photometric redshifts. Horizontal d a shed b lack lines 
show the galaxy spectral classes of [Ilbert et al.| | |2010[ | . Margin 
plots show the distribution of colors and stellar masses tor centrals 
(red solid), satellites (gray dashed), and a redshift- matched sample 
of field galaxies (blue dotted), rescaled for comparison. Axis labels 
should be multiplied by 15 (centrals), 150 (satellites), and 500 (red- 
shift matched field sample) to obtain the normalized distributions. 
The lower end of the stellar mass range plotted corresponds to our 
completeness limit at 2: = 1. 

ies in the same stellar mass bins to show the variation 
between groups. At all redshifts, higher mass galaxies 
tend to have higher red fractions than lower mass galax- 
ies. In addition to this stellar mass dependence, we see 
a clear separation between the group and field popula- 
tions at all redshifts for the stellar masses probed. The 



cuts described in Bundy et al. (2010) to separate passive 
galaxies in COSiVluy from diasty star-forming galaxies 
also gives qualitatively similar results. Though the ab- 
solute fraction of red galaxies measured depends on the 
specific cuts used, the relative trends with stellar mass 
and environment are similar; galaxy groups have a larger 
proportion of red galaxies than the field, and groups dom- 
inated by blue galaxies are rare, though some appear to 
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Fig. 10. — Fraction of quenched galaxies as a function of stellar mass in different redshift bins (separate panels). Triangles show the 
quenched fraction of members with Pmom > 0.5 and projected group-centric distance within 0.5iJ200c of groups in the mass bins from 
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the arrows above these symbols indicate that they are likely to be biased lower than the true values. Vertical error bars show the standard 
deviation of 1000 bootstrap samples and horizontal bars represent the stellar mass bin widths. Gray circles show the quenched fraction in 
a given stellar mass bin for individual groups, with size proportional to the number of members in the bin. Note that the range of halo 
masses for group members varies between redshift bins. 



exist. 

We note that while our results confirm a significant 
relationship between color and environment out to the 
redshift limit of our sample, the cause of this relationship 
remains unknown. More detailed analysis distinguishing 
between physical processes happening before and after 
galaxies join groups is necessary to determine the role 
that groups play in this pr oces s. 

We now compare Figure 10 to results from the liter- 
ature. Our findings are consistent with results at low- 
redshift that show a decrease in star formation at high 
stellar masses, as well as a clear connection between 
star-formation and environment even after accounting 
for stellar mass diff erences (e.g. 



Baldry et al. 2006). At 



Kauffmann et al. 



0.47McGee et al. (2UTT 



2004 



report an environmental trend in the Gii)E(J survey that 
is smaller than a compa rable low-redshif t samp le, but 



suggest that the non-detection of environmental trends 
in other surveys is likely due to lower spectroscopic sam- 
pling rates and reliance on less confident redshifts which 
comprise a significant fraction of the sample at high red- 
shift. The photometric redshifts used in the present work 
are certainly less precise than spectroscopic redshifts, but 
have a much higher sampling density. 

An important distinction with this study is that we are 
using a unique sample of groups; X-ray detections en- 
sure a robust sample of structures that are virialized, a 
trait which may not be true of optically-selected groups. 
The X-ray groups are also more massive than the typical 
spectroscopically-selected groups, and may have formed 
earlier giving them a longer time to suppress star forma- 
tion in member galaxies. Other studies of this sample 
of X-ray groups also detect a significant environmental 
effect on galaxy c olors (Giodini et a l ., in p rep.; Tanaka 



still significant. However, Poggianti et al. (2008) do not et al., in prep.). Finoguenov et al. (2010) have shown 



detect a significant separation in the fraction of star- that the number density ot this X-ray group sample is in 



forming galaxies in cluster and field environments at 
2 = 0.4 — 0.8 after matching stellar mass distributions. 

At the highest redshift range covered by ou r group 
sampl e , ou r finding s are consis t ent w ith those of |Cooper| 
" 2010) and Peng et al.| |2010[ ) who detect a sig- 
i color-density relation m a similar stellar mass 
range covered by DEEP2 and zCOSMOS, respectively. 
These results appear to be at odds with some claims 
from VVDS and zCOSMOS analyses that such a rela- 
tion could be attributed solely to the existen ce of more 
massive ga laxies in dense environments (e.g., ^Scodeggio' 
|et al.,2009j|Cucciati et al.|2010|[Iovino et al.j2010| ). Each 
ot these studies consider the environmental effect on color 
at ffxed stellar mass, as we have done here. [Cooper 



et al. (20101 emphasize that systematics in the selection 
ot dense environments tend to wash out the measured 
signal of environmental dependence, so that if such cor- 
relations are seen they are likely to be real. They also 



reasonable agreement with that expected for halos of cor- 
responding masses within our cosmological model. The 
sample is therefore unlikely to be an extreme population 
of groups, though we cannot rule out subtle differences 
between X-ray-selected groups and the full population of 
groups in this mass range. 

Because of differences in analysis methods, it is difficult 
to determine whether the qualitatively distinct findings 
in these environmental studies are also quantitatively in- 
consistent, or whether different results are simply due to 
measuring different quantities. We now consider aspects 
of the analysis methods and definitions of environments 
that may contribute to the differing results, focusing our 
attention to field studies at z ~ 1 where the results ap- 
pear most discrepant. For instance, we have shown in 
Figure |8] that centroiding errors can infiuence the pu- 
rity ana completeness of a group sample, and our lens- 
ing tests (see Paper II) ensure a reliable determination 
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of the center of mass. Additionally, we use only galax- 
ies and groups in mass-complete bins, avoiding the need 
for volumetric corrections made in some of the previous 
analyses. 

Our group-based definiti on of dense en vironments is 
most similar to that of lovino et al. ( 2010, ), who studied 
optically selected groups which are less massive on aver- 
age than the X-ray groups studied here. We leave a de- 
tailed comparison between optically and X-ray selected 
groups in GOSMOS to a fut ure paper (Finog uenov et. 
al, in prep.), but Figure 22 of Kovac et al. (2010b) shows 
that these X-ray selected groups tend to be m more dense 
regions on average than the optically selected ones. The 
richest optical groups (with 4 or more members) show 
a good correspondence with X-ray groups in the density 
field and ar e more likely to h ave similar halo masses. 
Figure 12 of lovino et al. ( 2010) shows that the colors of 
stellar mass-selected samples do not significantly depend 
on group richness (except perhaps at low stellar mass 
and redshift), so the difference in halo masses between 
X-ray and optical groups is not obviously th e cause of dis- 
crepancy between our results and those of [lovino et al.| 
mm. ^Knobel et al.| ( [2C)09 ) tested the purity of the op- 



between these a nalyses. The fif t h nea rest neighbor es 



tical group sample with mock catalogs and found a con- 
tamination fraction of roughly 25%, improving to about 
15% for richer groups. These values are similar to our 
estimate of 16% contamination within 0.5i?2ooc for the 
sample used in our analysis, so the correct identification 
of group members is not a clear cause for the difference 
in results either. 



Gucciati et al. (20101 quantify environment using the 
local galaxy overdensity S = {p — p)/p, where p is the 
local density computed from the distance to the fifth 
nearest neighbor and p is the mean density at a given 
redshift. Their Figure 10 shows a red fraction that is 
roughly constant across quartiles of the local overdensity 
distribution for z > 0.5 and log(M^/M0) > 10.85, with 
a weak overdensity dependence at lo wer redshift . With 



the same data and density indicator, Peng et al. (20101 
study the density field p instead of the overdensity field 
S and show a significant color-density relation at z ~ 0.5 
and state that it continues to at least z = 1. They at- 
tribute the difference in results to the fact that a lower 
fraction of galaxies at z ~ 1 live in regions with high S, 
and so the highest overdensity quartile used in the anal- 
ysis of Gucciati et al. ( [2010 } is presumably too broad 
at z ~ 1 to isolate the small population of galaxies in 
regions dense enough to pr oduce strong environmental 



effects. Gooper et al. (2010) use a third nearest neighbor 



density estimator on DEEP2 data and identify a differ- 
ence in the distribution of colors between galaxies in the 
upper 10% and lower 50% of the density distribution, 
so perhaps a cut more stringent than the upper quartile 
of the density distribution is needed to detect environ- 
mental effects at z ^ 1. Referring again to Figure 22 of 



Kovac et al. (2010b I however, we see that X-ray groups 
live in roughly the same ra nge of overdensities a s the up- 
per quartile of S used in the Gucciati et al. ( 2010|) analysis 
(log(l -I- (5) > 1). Thus the explanation from Peng et al. 
(2010| for the non-detection of environmental trends at 
z ~ 1 by [Gucciati et al. (2010) is not obviously appli- 
cable since we see a clear environmental signal in this 
overdensity range. 
The scale on which environment is defined also differs 



timator used by Gucciati et al. (2010) and Peng et al 
(120101) and third nearest neighbor used by Gooper et al 
(pOlO) measure environment on a scale that varies with 
density and is typically of order 1 Mpc. IScodcggio et al.[ 
( j2009| ) measures a density field on a significantly larger 
scale of 8 Mpc, so correlations on the smaller scales of 
group halos may be washed out. The group sample in 
Figure [T0| is restricted to 0.5i?2ooc but the environmen- 
tal signaTis not significantly different when all selected 
members out to i?200c Siie included, despite the higher 
contamination fraction. In either case, -R200C is typically 
about 500 kpc for these groups, so our sample is likely 
probing the environment on smaller scales than studies 
using the galaxy density field. 

Differences in the colors used to identify galaxies as red 
or blue could also contribute to the contrasting results. 
The other studies discussed here typically use rest-frame 
U — B or B — I colors, sometimes with a mass or redshift 
dependent cut to account for varying populations. We 
use an extinction-corrected rest-frame NUV — R color 
to account for unquenched galaxies that appear red due 
to dust. To test the effect of this correction, we have 
tried redefining the sample of red galaxies using the cuts 
NUV — R > 3.5 or U — B > 1 without any extinction 
correction. The red fraction increases due to the influ- 
ence of dust redding, and the separation between group 
and field values at z > 0.5 is reduced by up to a factor 
of 2, but we still see a clear difference between the red 
fraction in group and field environments. 

We have not identified an obvious single factor to ex- 
plain why previous analyses did not detect an environ- 
mental effect on galaxy color, but suspect the cause to 
be a combination of factors mentioned above. To avoid 
confusion when discussing environmental effects and to 
properly detected these trends, it is clearly important to 
specify what is meant by "environment" and to measure 
it carefully. 

7.2. Star Forming Galaxies and the SZ Power Spectrum 

Recent high-resolution, ground-based experiments 
have probed the power spectrum of t he GMB to unprece 
dented fine scales (e.g., i > 2000; [Lueker et al.[[2010 



Fowler et al.|2010[|Shiro"koff et al.|20l0| [Das et al.pUTT ) 
which are sensitive to a variety of secondary anisotropies, 
such as radio and submillimeter point sources, and the 
SZ effect from groups and clusters. It was found that the 
power due to the SZ effect was 50% or less than predic- 
tions of most of the models ( Lueker et al.|2010 Dunkley 
et al.[[20l"0 1 , which could be indicative of our incomplete 
understanding of the properties of groups and clusters, 
especially the low mass (< IO^^Mq) systems at z > 0.5, 
as they are bel ieved to contrib ute half o f the SZ power at 
i « 3000 (e.K.,[Komatsu fc Seljak 2002lpiaw et al.|2010 



Trac et al. 



2011 ). It IS possible that the hot gas pressure 



profile of distant groups behaves differently from that of 
clusters or that cluster profiles deviate from expectations 
at large radii, although it was recently shown that local 
group s obey the "universal" pressure profile (Sun et al. 
2011 1 and at least one nearby cluster obeys th e profile out 
to it!2 nnc af ter accounting for gas clumping (Simionescu 



et al. 2011). Another possibility is that star formation 
(SI') activity is elevated in high-z systems, and the con- 
tribution of unresolved SF galaxies in the submillimeter 
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regime could fill in the SZ de crement at ~ 15 GHz, thus 
reducing the SZ power (e.g., Hall et aL]|2010 ). Using the 
z = 0.5 — 1.0 COSMOS groups, we are m a good position 
to investigate the contamination due to SF galaxies in 
groups. 

For each group, we cross-matched all candidate mcm- 
ber galaxies with the MIPS 24 jum sourc e catalog 



(Sanders et al. 2007 



matching radius ot 2 



Le Floc'h et al. | 
l^'or the matchec 



I| [2009[ ), using a 
ia objects, we as- 



sumed a sta rburst SEP (spanning from 3600A to 1 cm 



taken from Lagache et al. (2003), and compared the 



24 /im to 148 GHz flux ratio. More specifically, we ap 
proximated the MIPS 24 ^m band as a tophat spanning 
20.8 to 26.1 fj,m, and we used 18 GHz as the bandwidth 
for the 148 GHz channel of the Atacama Cosmology Tele- 
scope (ACT). Given the mass of our grou ps, we estimated 
the SZ flux for our groups, following Majumdar & Mohr 
( [2004^ . We find that the sum of the fluxes from SF galax- 
ies at 148 GHz is negligible (typically < 0.3%) compared 

to the magnitude of the SZ effect from these groups. 

We selected the SED from the set of templates of |La- 
gache et al. ( 2003 ) that maximizes the 148 GHz to 24 /im 
tlux ratio to set a conservative upper limit, but uncertain- 
ties in the spectral model could allow for a lar ger flux at 
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millim eter wavelengths from these sources. 
(20101 compare stacked MIPS measure ments at 24, 70, 
and 160 iim to empiric al templates from Chary & Elbaz' 
( [200T| ) ; IDale fc Helo u| f2002 ) ; Lagac he et al. (2003 ) and 
theoretical models from Sicbcnmorg en & Kriigelj ( [20ly7| ) . 
For the stacked MIPS fluxes of sources at z = 0.5 -- 1, tfie 



best-fitting template from Lagache et al. ( |2003 1 tends to 
predict a flux at wavelengths longer than 300 /iin that can 
be an order of magnitude lower than other models. Even 
accounting for these uncertainties in the spectral mod- 
els, our upper limit to the contamination of star-forming 
galaxies to the SZ signal from our sample of groups is no 
more than a few percent. 

Our group sample is limited to z < 1. Several studies 
have reported elev ated SF activity in centers of clusters 
at z > 1.4 



Hilton et 
IT 



al. 2010 Tran et al. 2010 



Tanaka et al.| 20l0r It remains to be seen if tne SF 
gafaxies coufd be abundant enough at higher redshifts 
to make a significant impact on the SZ signal in such 
systems. 

8. SUMMARY AND CONCLUSIONS 

We have presented a catalog of member galaxies in X- 
ray selected groups in the COSMOS field, carefully tak- 
ing into account photo-z errors and attempting to avoid 
biasing the sample with assumptions about the prop- 
erties of galaxies which have not previously been well- 
constrained in this mass and redshift range. We have 
thoroughly characterized the quality of the selection al- 
gorithm using tests with mock catalogs and spectroscopic 
redshifts. In these tests we discovered contamination 
from galaxies in projection in the outskirts of groups, 
but selection in the central regions is relatively clean. 
We have also studied the prospects for applying this se- 
lection algorithm to future multi- wavelength data sets, 
estimating the purity and completeness of the member 
selection as a function of redshift and centering uncer- 
tainties. 

Analyzing this sample of group members, we have 
shown that both stellar mass and environment play a role 



in determining galaxy colors at z '-^ 1. We emphasize 
that there are many ways to smear out environmental 
correlations and that these factors must be properly con- 
trolled in order to detect environmental trends. The X- 
ray groups studied here provide a clean sample of dense 
environments for which we can determine halo masses 
and centers. 

Following our finding of suppressed star formation in 
group environments at all redshifts sampled, we inves- 
tigated the possibility that clustered dusty star-forming 
galaxies could reduce the detected power in the high- 
£ CMB power spectrum by filling in SZ decrements in 
groups, and found that the effect must be quite small 
in this group sample. In contrast with the results relat- 
ing to the suppression of star formation in groups, we 
have identified several blue central galaxies, which war- 
rant further study. 

In Paper II of this series on our sample of galaxy groups 
and members, we will describe weak lensing tests used to 
optimize the centering by finding tracers that best locate 
the center of mass. Further work will analyze galaxy 
properties in groups with respect to the distance from 
these centers, providing constraints on models describing 
the evolution of galaxies in dense environments. With a 
carefully selected sample of member galaxies in groups 
with well-constrained masses and centers, we can hope 
to map the course by which galaxies transform. 
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